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RECOMBINANT BONE MORPHOGENETIC PROTEIN HETERODIMERS , 
COMPOSITIONS AND METHODS OF USE 



Field of the Invention 

The present invention relates to a series of 
novel recombinant heterodimeric proteins useful in the 
field of treating bone defects, healing bone injury and 
in wound healing in general. The invention also relates 
to methods for obtaining these heterodimers, methods for 
producing them by recombinant genetic engineering 
techniques, and compositions containing them. 

Background of the Invention 

In recent years, protein factors which are 
characterized by bone or cartilage growth inducing 
properties have been isolated and identified. See, e.g., 
U. S. Patent No. 5,013,649, PCT published application 
WO90/11366; PCT published application WO91/05802 and the 
variety of references cited therein. See, also, 
PCT/US90/ 05903 which discloses a protein sequence termed 
OP-1, which is substantially similar to human BMP-7, and 
has been reported to have osteogenic activity. 

A family of individual bone morphogenetic 
proteins (BMPs) , termed BMP-2 through BMP-9 have been 
isolated and identified. Incorporated by reference for 
the purposes of providing disclosure of these proteins 
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and methods of producing them are co-owned, co-pending U. 
S. Patent Application SN 721,847 and the related 
applications recited in its preamble. Of particular 
interest, are the proteins termed BMP-2 and BMP-4, 
5 disclosed in the above-referenced application; BMP-7, 

disclosed in SN 438,919; BMP-5, disclosed in SN 370,547 
and SN 356,033; and BMP-6, disclosed in SN 370,544 and SN 
347,559; and BMP-8, disclosed in SN 525,357. Additional 
members of the BMP family include BMP-1, disclosed in SN 

10 655,578; BMP-9, disclosed in SN 720,590; and BMP-3, 

disclosed in SN 179,197 and PCT publication 89/01464. 
These applications are incorporated herein by reference 
for disclosure of these BMPs. 

There remains a need in the art for other 

15 proteins and compositions useful in the fields of bone 
and wound healing. 
Summary of the Invention 

In one aspect, the invention provides a method 
for producing a recombinant heterodimeric protein having 

20 bone stimulating activity comprising culturing a selected 
host cell containing a polynucleotide sequence encoding a 
first selected BMP or fragment thereof and a 
polynucleotide sequence encoding a second selected BMP or 
fragment thereof. The resulting co-expressed, 

25 biologically active heterodimer is isolated from the 

culture medium. 

According to one embodiment of this invention, 



the host cell may be co-transf ected with one or more 
vectors containing coding sequences for one or more BMPs. 
Each BMP polynucleotide sequence may be present on the 
same vector or on individual vectors transf ected into the 
cell. Alternatively, the BMPs or their fragments may be 
incorporated into a chromosome of the host cell. 
Additionally, a single transcription unit may encode 
single copy of two genes encoding a different BMP. 

According to another embodiment of this 
invention, the selected host cell containing the two 
polypeptide encoding sequences is a hybrid cell line 
obtained by fusing two selected, stable host cells, each 
host cell transf ected with, and capable of stably 
expressing, a polynucleotide sequence encoding a selected 
first or second BMP or fragment thereof. 

In another aspect of the present invention, 
therefore, there are provided recombinant heterodimeric 
proteins comprising a protein or fragment of a first BMP 
in association with a protein or fragment of a second 
BMP. The heterodimer may be characterized by bone 
stimulating activity. The heterodimers may comprise a 
protein or fragment of BMP-2 associated with a protein or 
fragment of either BMP-5, BMP-6, BMP-7 or BMP-8; or a 
protein or fragment of BMP-4 associated with a protein or 
fragment of either BMP-5, BMP-6, BMP-7 or BMP-8. In 
further embodiments the heterodimers may comprise a 
protein or fragment of BMP-2 associated with a protein or 
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fragment of either BMP-l, BMP-3 or BMP-4. BMP-4 nay also 
form a heterodimer in association with BMP-l, BMP-2 or a 
fragment thereof. Still further embodiments may comprise 
heterodimers involving combinations of BMP-5, BMP-6, BMP- 
5 7 and BMP-8. For example, the heterodimers may comprise 
BMP-5 associated with BMP-6, BMP-7 or BMP-8; BMP-6 
associated with BMP-7 or BMP-8; or BMP-7 associated with 
BMP-8. These heterodimers may be produced by co- 
expressing each protein in a selected host cell and 

10 isolating the heterodimer from the culture medium. 

. As a further aspect of this invention a cell 
line is provided which comprises a first polynucleotide 
sequence encoding a first BMP or fragment thereof and a 
second polynucleotide sequence encoding a second BMP or 

15 fragment thereof, the sequences being under control of 
one or more suitable expression regulatory systems 
capable of co-expressing the BMPs as a heterodimer. The 
cell line may be transf ected with one or more than one 
polynucleotide molecule. Alternatively, the cell line 

20 may be a hybrid cell line created by cell fusion as 
described above. 

Another aspect of the invention is a 
polynucleotide molecule or plasmid vector comprising a 
polynucleotide sequence encoding a first selected BMP or 

25 fragment thereof and a polynucleotide sequence encoding a 
second selected BMP or fragment thereof. The sequences 
are under the control of at least one suitable regulatory 



sequence capable of directing co-expression of each 
protein or fragment. The molecule may contain a single 
transcription unit containing a copy of both genes, or 
more than one transcription unit, each containing a copy 
of a single gene. 

As still another aspect of this invention there 
is provided a method for producing a recombinant dimeric 
or heterodimeric protein having bone stimulating activity 
in a prokaryotic cell comprising culturing a selected 
host cell containing a polynucleotide sequence encoding a 
first selected BMP or fragment thereof; culturing a 
second selected host cell containing a polynucleotide 
sequence encoding a second selected BMP or fragment 
thereof; isolating monomeric forms of each BMP protein 
from the culture medium and co-assembling a monomer of 
the first protein with a monomer of the second protein. 
The first protein and the second protein may be the same 
or different BMPs. The resulting biologically active 
dimer or heterodimer is thereafter isolated from the 
mixture. Preferred cells are JjU. coli . 

Thus, as further aspects of this invention 
recombinant BMP dimers or heterodimers produced in 
eukaryotic cells are provided, as well as suitable 
vectors or plasmids, and selected transformed cells 
useful in such a production method. 

Other aspects and advantages of the present 
invention are described further in the following detailed 



description of preferred embodiments of the present 
invention. 

Brief Description the Figures 

Figure 1 provides the DNA and amino acid 
sequences of human BMP-2 (SEQ ID NOs: 1 and 2) . 

Figure 2 provides the DNA and amino acid 
sequences of human BMP-4 (SEQ ID NOs: 3 and 4) . 

Figure 3 provides the DNA and amino acid 
sequences of human BMP-7 (SEQ ID NOs: 5 and 6) . 

Figure 4 provides the DNA and amino acid 
sequences of human BMP-6 (SEQ ID NOs: 7 and 8). 

Figure 5 provides the DNA and amino acid 
sequences of human BMP-5 (SEQ ID NOs: 9 and 10) . 

Figure 6 provides the DNA and amino acid 
sequences of human BMP-8 (SEQ ID NOs: 11 and 12). 

Figure 7 provides the DNA sequence of vector 
PALB2-781 containing the mature portoin of the BMP-2 gene 
(SEQ ID NOs: 13 and 14) . 

Figure 8 compares the activity of CHO BMP-2 and 
CHO BMP-2/7 in the W20 alkaline phosphatase assay. 

Figure 9 compares the activity of CHO BMP-2 and 
CHO BMP-2/7 in the BGP (osteocalcin) assay. 

Figure 10 provides a comparison of the W-20 
activity of coli produced BMP-2 and BMP-2/7 
heterodimer. 

Figure 11 depicts BMP-3 DNA and amino acid sequence. 
Figure 12 provides a comparison of BMP-2 and BMP-2/ 6 



in the W-20 assay. 

Figure 13 provides a comparison of the in vivo 
activity of BMP-2/ 6 and BMP-2. 

Figure 14 provides a comparison of BMP-2, BMP-6 and 
BMP-2/6 in vivo activity. 

Detailed Description of the Invention 

The present invention provides a method for 
producing recombinant heterodimeric proteins having bone 
stimulating activity, as well as the recombinant 
heterodimers themselves, and compositions containing them 
for bone-stimulating or repairing therapeutic use. 

As used throughout this document, the term 
'heterodimer' is defined as a biologically-active protein 
construct comprising the association of two different BMP 
protein monomers or active fragments thereof joined 
through at least one covalent, disulfide linkage. A 
heterodimer of this invention may be characterized by the 
presence of between one to seven disulfide linkages 
between the two BMP component strands. 

According to the present invention, therefore, 
a method for producing a recombinant BMP heterodimer 
according to this invention comprises culturing a 
selected host cell containing a polynucleotide sequence 
encoding a first selected BMP or a biologically active 
fragment thereof and a polynucleotide sequence encoding a 
second selected BMP or a fragment thereof. The resulting 
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co-expressed, biologically active heterodimer is formed 
within the host cell, secreted therefrom and isolated 
from the culture medium. Preferred embodiments of 
methods for producing the heterodimeric proteins of this 
5 invention, are described in detail below and in the 

following examples. Preferred methods of the invention 
involve known recombinant genetic engineering techniques 
[See, e.g., Sambrook et al, "Molecular Cloning. A 
Laboratory Manual i", 2d edition, Cold Spring Harbor 
10 Laboratory, Cold Spring Harbor, NY (1989)]. However, 
other methods, such as conventional chemical synthesis 
may also be useful in preparing a heterodimer of this 
invention. 

BMP heterodimers generated by this method are 
15 produced in a mixture of homodimers and heterodimers. 
This mixture of heterodimers and homodimers may be 
separated from contaminants in the culture medium by 
resort to essentially conventional methods, such as 
classical protein biochemistry or affinity antibody 
20 columns specific for one of the BMPs making up the 

heterodimer. Additionally, if desired, the heterodimers 
may be separated from homodimers in the mixture. Such 
separation techniques allow unambiguous determination of 
the activity of the heterodimeric species. Example 4 
25 provides one presently employed purification scheme for 

this purpose. 

Preferably the recombinant heterodimers of this 



invention produced by these methods involve the BMPs 
designated human BMP-2, human BMP-4, human BMP-5, human 
BMP-6, human BMP-7 and BMP-8. However, BMP -3 has also 
been determined to form an active heterodimer with BMP-2. 
Other species of these BMPs as well as BMPs than those 
specifically identified above may also be employed in 
heterodimers useful for veterinary, diagnostic or 
research use. However, the human proteins, specifically 
those proteins identified below, are preferred for human 
pharmaceutical uses. 

Human BMP-2 is characterized by containing 
substantially the entire sequence, or fragments, of the 
amino acid sequence and DNA sequence disclosed in Figure 
1. Human BMP-2 proteins are further characterized as 
disulf ide-linked dimers and homodimers of mature BMP-2 
subunits. Recombinantly-expressed BMP-2 subunits include 
protein species having heterogeneous amino termini. One 
BMP-2 subunit is characterized by comprising amino acid 
#249 (Ser) - #396 (Arg) of Figure 1 (SEQ ID NOs: 1 and 
2) . Another BMP-2 subunit is characterized by comprising 
amino acid #266 (Thr) - #396 (Arg) of Figure 1. Another 
BMP-2 subunit is characterized by comprising amino acid 
#296 (Cys) - #396 (Arg) of Figure 1. A mature BMP-2 
subunit is characterized by comprising amino acid #283 
(Gin) - #396 (Arg) of Figure 1. This latter subunit is 
the presently most abundant protein species which results 
from recombinant expression of BMP-2 (Figure 1) . 
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However, the proportions of certain species of BMP-2 
produced may be altered by manipulating the culture 
conditions. BMP-2 may also include modifications of the 
sequences of Figure 1, e.g., deletion of amino acids 
5 #241-280 and changing amino acid #245 Arg to lie, among 

other changes. 

As described in detail in United States Patent 
Application SN 721,847, incorporated by reference herein, 
human BMP— 2 may be produced by culturing a cell 

10 transformed with a DNA sequence comprising the nucleotide 
coding sequence from nucleotide #356 to #1543 in Figure 1 
and recovering and purifying from the culture medium one 
or more of the above- identified protein species, 
substantially free from other proteinaceous materials 

15 with which it is co-produced. Human BMP-2 proteins are 
characterized by the ability to induce bone formation. 
Human BMP-2 also has in vitro activity in the W20 
bioassay. Human BMP-2 is further characterized by the 
ability to induce cartilage formation. Human BMP-2 may 

20 be further characterized by the ability to demonstrate 

cartilage and/ or bone formation activity in the rat bone 
formation assay described in the above-referenced 

application. 

Human BMP-4 is characterized by containing 
25 substantially the entire sequence, or fragments, of the 
amino acid sequence and DNA sequence disclosed in Figure 
2 (SEQ ID NOs: 3 and 4) . Human BMP-4 proteins are 
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further characterized as disulf ide-linked dimers and 
homodimers of mature BMP-4 subunits. Recombinantly- 
expressed BMP-4 subunits may include protein species 
having heterogeneous amino termini. A mature subunit of 
human BMP-4 is characterized by an amino acid sequence 
comprising amino acids #293 (Ser) - #408 (Arg) of Figure 
2. Other amino termini of BMP-4 may be selected from the 
sequence of Figure 2. Modified versions of BMP-4, 
including proteins further truncated at the amino or 
carboxy termini, may also be constructed by resort to 
conventional mutagenic techniques. 

As disclosed in above- incorporated patent 
application SN 721,847, BMP-4 may be produced by 
culturing a cell transformed with a DNA sequence 
comprising the nucleotide coding sequence from nucleotide 
#403 to nucleotide #1626 in Figure 2 and recovering and 
purifying from the culture medium a protein containing 
the amino acid sequence from amino acid #293 to #408 as 
shown in Figure 2, substantially free from other 
proteinaceous materials with which it is co-produced. 
BMP-4 proteins are capable of inducing the formation of 
bone. BMP-4 proteins are capable of inducing formation 
of cartilage. BMP-4 proteins are further characterized 
by the ability to demonstrate cartilage and/ or bone 
formation activity in the rat bone formation assay. 

Human BMP-7 is characterized by containing 
substantially the entire sequence, or fragments, of the 
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amino acid sequence and DNA sequence disclosed in Figure 
3. Human BMP-7 proteins are further characterized as 
disulf ide-linked dimers and homodimers of mature BMP-7 
subunits. Recombinantly-expressed BMP-7 subunits include 
5 protein species having heterogeneous amino termini. One 
BMP-7 subunit is characterized by comprising amino acid 
#293 (Ser) - #431 (His) of Figure 3 (SEQ ID NOs: 5 and 
6) . This subunit is the most abundantly formed protein 
produced by recombinant expression of the BMP-7 sequence. 

10 Another BMP-7 subunit is characterized by comprising 

amino acids #300 (Ser) - #431 (His) of Figure 3. Still 
another BMP-7 subunit is characterized by comprising 
amino acids #316 (Ala) - #431 (His) of Figure 3. Other 
amino termini of BMP-7 may be selected from the sequence 

15 of Figure 3. Similarly, modified versions, including 
proteins further truncated at the amino or carboxy 
termini, of BMP-7 may also be constructed by resort to 
conventional mutagenic techniques. 

As disclosed in above-incorporated patent 

20 application SN 438,919, BMP-7 may be produced by 
culturing a cell transformed with a DNA sequence 
comprising the nucleotide coding sequence from nucleotide 
#97 to nucleotide #1389 in Figure 3 and recovering and 
purifying from the culture medium a protein containing 

25 the amino acid sequence from amino acid #293 to #431 as 
shown in Figure 3, substantially free from other 
proteinaceous or contaminating materials with which it is 
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co-produced. These proteins are capable of stimulating, 
promoting, or otherwise inducing cartilage and/ or bone 
formation. 

Human BMP-6 is characterized by containing 
substantially the entire sequence, or fragments, of the 
amino acid sequence and DNA sequence disclosed in Figure 
4. Human BMP-6 proteins are further characterized as 
disulf ide-linked dimers of mature BMP-6 subunits. 
Recombinantly-expressed BMP-6 subunits may include 
protein species having heterogeneous amino termini. One 
BMP-6 subunit is characterized by comprising amino acid 
#375 (Ser) - #513 (His) of Figure 4 (SEQ ID NOs: 7 and 
8). Other amino termini of BMP-6 may be selected from 
the sequence of Figure 4. Modified versions, including 
proteins further truncated at the amino or carboxy 
termini, of BMP-6 may also be constructed by resort to 
conventional mutagenic techniques. 

As described in detail in United States Patent 
Application SN 490,033, incorporated by reference herein, 
human BMP-6 may be produced by culturing a cell 
transformed with a DNA sequence comprising the nucleotide 
coding sequence from nucleotide #160 to #1698 in Figure 4 
and recovering and purifying from the culture medium a 
protein comprising amino acid #375 to #513 of Figure 4, 
substantially free from other proteinaceous materials or 
other contaminating materials with which it is co- 
produced. Human BMP-6 may be further characterized by 
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the ability to demonstrate cartilage and/ or bone 
formation activity in the rat bone formation assay. 

Human BMP-5 is characterized by containing 
substantially the entire sequence, or fragments, of the 
amino acid sequence and DNA sequence disclosed in Figure 
5 (SEQ ID NOs: 9 and 10) . Human BMP-5 proteins are 
further characterized as disulf ide-linked dimers of 
mature BMP— 5 subunits. Recombinant ly-expressed BMP-5 
subunits may include protein species having heterogeneous 
amino termini. One BMP-5 subunit is characterized by 
comprising amino acid #329 (Ser) - #454 (His) of Figure 
5. Other amino termini of BMP-5 may be selected from the 
sequence of Figure 5. Modified versions, including 
proteins further truncated at the amino or carboxy 
termini, of BMP-5 may also be constructed by resort to 
conventional mutagenic techniques. 

As described in detail in United States Patent 
Application SN 588,227, incorporated by reference herein, 
human BMP-5 may be produced by culturing a cell 
transformed with a DNA sequence comprising the nucleotide 
coding sequence from nucleotide #701 to #2060 in Figure 5 
and recovering and purifying from the culture medium a 
protein comprising amino acid #329 to #454 of Figure 5, 
substantially free from other proteinaceous materials or 
other contaminating materials with which it is co- 
produced. Human BMP-5 may be further characterized by 
the ability to demonstrate cartilage and/ or bone 



15 

formation activity in the rat bone formation assay 
<* described in the above-referenced application. 

Human BMP-8 is characterized by containing 
substantially the entire sequence, or fragments, of the 
5 amino acid sequence and DNA sequence disclosed in Figure 

6. Human BMP-8 proteins may be further characterized as 
disulf ide-linked dimers of mature BMP-8 subunits. 
Recombinantly-expressed BMP-8 subunits may include 
protein species having heterogeneous amino termini. A 

10 BMP-8 sequence or subunit sequence comprises amino acid 
#143 (Ala) - #281 (His) of Figure 6 (SEQ ID NOs: 11 and 
12) . Other amino termini of BMP-8 may be selected from 
the sequence of Figure 6. Modified versions, including 
proteins further truncated at the amino or carboxy 

15 termini, of BMP-8 may also be constructed by resort to 
conventional mutagenic techniques. 

As described generally in United States Patent 
Application SN 525,357, incorporated by reference herein, 
and as further described herein, human BMP-8 may be 

20 produced by culturing a cell transformed with a DNA 

sequence comprising the nucleotide coding sequence from 
nucleotide #1 to #850 in Figure 6 and recovering and 
purifying from the culture medium a protein comprising 
amino acid #143 to #281 of Figure 6, or similar amino 

25 acid sequences with heterogenous N-termini, substantially 

free from other proteinaceous materials or other 
contaminating materials with which it is co-produced. 
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This BMP-8 may also be produced in E. coli by inserting 
into a vector the sequence encoding amino acid #143 to 
281 of Figure 6 with a Met inserted before amino acid 
#142. Human BMP-8 may be further characterized by the 
5 ability to demonstrate cartilage and/or bone formation 
activity in the rat bone formation assay. 

Each above described BMP protein in its native, 
non-reduced dimeric form may be further characterized by 
an apparent molecular weight on a 12% Laemmli gel ranging 
10 between approximately 28kD to approximately 40kD. 

Analogs or modified versions of the DNA and amino acid 
sequences described herein which provide proteins or 
active fragments displaying bone stimulating or repairing 
activity in the rat bone formation assay described below 
in Example 9, are also classifed as suitable BMPs for use 
in this invention, further provided that the proteins or 
fragments contain one or more Cys residues for 
participation in disulfide linkages. Useful 
modifications of these sequences may be made by one of 
skill in the art with resort to known recombinant genetic 
engineering techniques. Production of these BMP 
sequences in mammalian cells produces homodimers, 
generally mixtures of homodimers having heterologous N 
termini. Production of these BMP sequences in E.coli 
25 produces monomer ic protein species. 

Thus, according to this invention one 
recombinant heterodimer of the present invention 
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comprises the association of a human BMP-2, including, 
•& e.g., a monomer ic strand from a mature BMP-2 subunit as 

described above or an active fragment thereof, bound 
through one or up to seven covalent, disulfide linkages 
5 to a human BMP-5 including, e.g., a monomeric strand from 

a mature BMP-5 subunit as described above or an active 
fragment thereof. Another recombinant heterodimer of the 
present invention comprises the association of a human 
BMP-2, as described above, bound through one or up to 

10 seven covalent, disulfide linkages to a human BMP-6, 

including, e.g., a monomeric strand from a BMP-6 subunit 
as described above or an active fragment thereof. 
Another recombinant heterodimer of the present invention 
comprises the association of a human BMP-2, as described 

15 above, bound through one or up to seven covalent, 

disulfide linkages to a human BMP-7, including, e.g., a 
monomeric strand of a BMP-7 subunit as described above or 
an active fragment thereof. Another recombinant 
heterodimer of the present invention comprises the 

20 association of a human BMP-2, as described above, bound 
through one or up to seven covalent, disulfide linkages 
to a human BMP-8, including, e.g., a monomeric strand of 
a BMP-8 subunit as described above or an active fragment 
<&■ thereof . 

25 Still another recombinant heterodimer of the 

present invention comprises the association of a human 
BMP-4, including, e.g., a monomeric strand of a BMP-4 
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subunit as described above or an active fragment thereof, 
bound through one or up to seven covalent, disulfide 
linkages to a human BMP-5, as described above. Another 
recombinant heterodimer of the present invention 
5 comprises the association of a human BMP-4, as described 
above, bound through one or more covalent, disulfide 
linkages to a human BMP-6, as described above. Another 
recombinant heterodimer of the present invention 
comprises the association of a human BMP-4, as described 

10 above bound through one or more covalent, disulfide 

linkages to a human BMP-7, as described above. Another 
recombinant heterodimer of the present invention 
comprises the association of a human BMP-4, as described 
above, bound through one or more covalent, disulfide 

15 linkages to a human BMP-8, as described above. 

A further recombinant heterodimer of the 
present invention . comprises the association of a human 
BMP-2, including, e.g., a monomeric strand from a mature 
BMP-2 subunit as described above or an active fragment 

20 thereof, bound through at least one disulfide linkage to 
a human BMP-3 including, e.g., a monomeric strand from a 
mature BMP-3 subunit as described above or an active 
fragment thereof. Another recombinant heterodimer of the 
present invention comprises the association of a human 

25 BMP-2, as described above, bound through at least one 
disulfide linkage to a human BMP-4, including, e.g., a 
monomeric strand from a BMP-4 subunit as described above 
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or an active fragment thereof • Another recombinant 
heterodimer of the present invention comprises the 
association of a human BMP-5, as described above, bound 
through at least one disulfide linkage to a human BMP-6, 
5 including, e.g., a monomer ic strand of a BMP-6 subunit as 
described above or an active fragment thereof. Another 
recombinant heterodimer of the present invention 
comprises the association of a human BMP-5, as described 
above, bound through at least one disulfide linkage to a 

10 human BMP-7, including, e.g., a monomeric strand of a 
BMP-7 subunit as described above or an active fragment 
thereof. In addition, human BMP-5 may be associated with 
human BMP-8 bound through at least one disulfide linkage 
to a human BMP-8 subunit or active fragment thereof. 

15 Still another recombinant heterodimer of the 

present invention comprises the association of a human 
BMP-6, including, e.g., a monomeric strand of a BMP-6 
subunit as described above or an active fragment thereof, 
bound through at least one disulfide linkage to a human 

20 BMP-7, as described above. Another recombinant 

heterodimer of the present invention comprises the 
association of a human BMP-6, as described above, bound 
through one or more covalent, disulfide linkages to a 
human BMP-8, as described above. Another recombinant 

25 heterodimer of the present invention comprises the 

association of a human BMP-7, as described above bound 
through one or more covalent, disulfide linkages to a 
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human BMP- 8, as described above. 

The disulfide linkages formed between the 
monomeric strands of the BMPs may occur between one Cys 
on each strand. Disulfide linkages may form between two 
Cys on each BMP. Disulfide linkages may form between 
three Cys on each BMP. Disulfide linkages may form 
between four Cys on each BMP. Disulfide linkages may 
form between five Cys on each BMP. Disulfide linkages 
may form between six Cys on each BMP. Disulfide linkages 
may form between seven Cys on each BMP. These disulfide 
linkages may form between adjacent Cys on each BMP or 
between only selected Cys interspersed within the 
respective protein sequence. Various heterodimers having 
the same BMP component strands may form with different 
15 numbers of disulfide linkages. Various heterodimers 
having the same BMP component strands may form with 
disulfide bonds at different Cys locations. Different 
heterodimers encompassed by this invention having the 
same BMP components may differ based upon their 
20 recombinant production in mammalian cells, bacterial 
cells, insect or yeast cells. 

These recombinant heterodimers may be 
characterized by increased alkaline phosphatase activity 
in the W20 mouse stromal cell line bioassay (Example 8) 
25 compared to the individual BMP homodimers, one strand of 
which forms each heterodimer. Further, these 
heterodimers are characterized by greater activity in the 
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W20 bioassay than is provided by simple mixtures of the 
individual BMP dimers. Preliminary characterization of 
heterodimers measured on the W20 bioassay have 
demonstrated that heterodimers of BMP-2 with BMP-5, BMP-6 
5 or BMP-7 are very active. Similarly, heterodimers of 

BMP-4 with BMP-5, BMP-6 or BMP-7 are strongly active in 
the W20 bioassay. 

Heterodimers of this invention may also be 
characterized by activity in bone growth and stimulation 

10 assays. For example, a heterodimer of this invention is 
also active in the rat bone formation assay described 
below in Example 9. The heterodimers are also active in 
the osteocalcin bioassay described in Example 8. Other 
characteristics of a heterodimer of this invention 

15 include co-precipitation with anti-BMP antibodies to the 
two different constituent BMPs, as well as characteristic 
results on Western blots, high pressure liquid 
chromatography (HPLC) and on two-dimensional gels, with 
and without reducing conditions. 

20 One embodiment of the method of the present 

invention for producing recombinant BMP heterodimers 
involves culturing a suitable cell line, which has been 
co-transf ected with a DNA sequence coding for expression 
of a first BMP or fragment thereof and a DNA sequence 

25 coding for expression of a second BMP or fragment 

thereof, under the control of known regulatory sequences. 
The transformed host cells are cultured and the 
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heterodimeric protein recovered and purified from the 

culture medium. 

In another embodiment of this method which is 
the presently preferred method of expression of the 
heterodimers of this invention, a single host cell, e.g., 
a CHO DUKX cell, is co-transf ected with a first DNA 
molecule containing a DNA sequence encoding one BMP and a 
second DNA molecule containing a DNA sequence encoding a 
second selected BMP. One or both plasmids contain a 
selectable marker that can be used to establish stable 
cell lines expressing the BMPs. These separate plasmids 
containing distinct BMP genes on seperate transcription 
units are mixed and transfected into the CHO cells using 
conventional protocols. A ratio of plasmids that gives 
15 maximal expression of activity in the W20 assay, 
generally, 1:1, is determined. 

For example, as described in detail in Example 
3, equal ratios of a plasmid containing the first BMP and 
a dihydrofolate reductase (DHFR) marker gene and another 
20 plasmid containing a second BMP and a DHFR marker gene 

can be co-introduced into DHFR-deficient CHO cells, DUKX- 
BII, by calcium phosphate coprecipitation and 
transfection, electroporation, microinjection, protoplast 
fusion or lipofection. Individual DHFR expressing 
25 transformants are selected for growth in alpha media with 
dialyzed fetal calf serum by conventional means. DHFR+ 
cells containing increased gene copies can be selected 



for propagation in increasing concentrations of 
methotrexate (MTX) (e.g. sequential steps in 0.02, 0.1 f 
0.5 and 2.0 uM MTX) according to the procedures of 
Kaufman and Sharp, J, Mol. Biol. , 159 ; 601-629 (1982); and 
Kaufman et al, Mol. Cell Biol. , 5:1750 (1983). 
Expression of the heterodimer or at least one BMP linked 
to DHFR should increase with increasing levels of MTX 
resistance. Cells that stably express either or both 
BMP /DHFR genes will survive. However at a high 
frequency, cell lines stably incorporate and express both 
plasmids that were present during the initial 
transfection. The conditioned medium is thereafter 
harvested and the heterodimer isolated by conventional 
methods and assayed for activity. This approach can be 
employed with DHFR-def icient cells. 

As an alternative embodiment of this method, a 
DNA molecule containing one selected BMP gene may be 
transfected into a stable cell line which already 
expresses another selected BMP gene. For example as 
described in detail in Example 3 below, a stable CHO cell 
line expressing BMP-7 with the DHFR marker (designated 
7MB9) [Genetics Institute, Inc] is transfected with a 
plasmid containing BMP-2 and a second selectable marker 
gene, e.g., neomycin resistance (Neo) . After 
transfection, the cell is cultured and suitable cells 
selected by treatment with MTX and the antibiotic, G-418. 
Surviving cells are then screened for the expression of 



WO 93/09229 



24 

the heterodimer. This expression system has the 
advantage of permitting a single step selection. 

Alternative dual selection strategies using 
different cell lines or different markers can also be 
5 used. For example, the use of an adenosine deaminase 

(ADA) marker to amplify the second BMP gene in a stable 
CHO cell line expressing a different BMP with the DHFR 
marker may be preferable, since the level of expression 
can be increased using deoxycoformycin (DCF) -mediated 
10 gene amplification. (See the ADA containing plasmid 
described in Example 1) . Alternatively, any BMP cell 
line made by first using this marker can then be the 
recipient of a second BMP expression vector containing a 
distinct marker and selected for dual resistance and BMP 

15 coexpression. 

Still another embodiment of a method of 
expressing the heterodimers of this invention includes 
transfecting the host cell with a single DNA molecule 
encoding multiple genes for expression either on a single 

20 transcription unit or on separate transcription units. 

Multicistronic expression involves multiple polypeptides 
encoded within a single transcript, which can be 
efficiently translated from vectors utilizing a leader 
sequence, e.g., from the EMC virus, from poliovirus, or 

25 from other conventional sources of leader sequences. Two 
BMP genes and a selectable marker can be expressed within 
a single transcription unit. For example, vectors 



25 

containing the configuration BMPx-EMC-BMPy-DHFR or BMPx- 
EMOBMPy-EMC-DHFR can be transfected into CHO cells and 
selected and amplified using the DHFR marker. A plasmid 
may be constructed which contains DNA sequences encoding 
two different BMPs, one or more marker genes and a 
suitable leader or regulatory sequence on a single 
transcription unit. 

Similarly> host cells may be transfected with a 
single plasmid which contains separate transcription 
units for each BMP. A selectable marker, e.g., DHFR, can 
be contained on a another transcription unit, or 
alternatively as the second cistron on one or both of the 
BMP genes. These plasmids may be transfected into a 
selected host cell for expression of the heterodimer, and 
the heterodimer isolated from the cells or culture medium 
as described above. 

Another embodiment of this expression method 
involves cell fusion. Two stable cell lines which 
express selected BMPs, such as a cell line expressing 
BMP-2 (e.g., 2EG5) and a cell line expressing BMP-7 
(e.g., 7MB9) , developed using the DHFR/MTX gene 
amplification system and expressing BMP at high levels, 
as described in Example 1 and in the above incorporated 
U.S. applications, can be transfected with one of several 
dominant marker genes (e.g., neo r , hygromycin r , GPT) . 
After sufficient time in coculture (approximately one 
day) one resultant cell line expressing one BMP and a 
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dominant marker can be fused with a cell line expressing 
a different BMP and preferably a different marker using a 
fusigenic reagent, such as polyethylene glycol, Sendai 
virus or other known agent. 
5 The resulting cell hybrids expressing both 

dominant markers and DHFR can be selected using the 
appropriate culture conditions, and screened for 
coexpression of the-BMPs or their fragments. The 
selected hybrid cell contains sequences encoding both 

10 selected BMPs, #md the heterodimer is formed in the cell 
and then secreted. The heterodimer is obtained from the 
conditioned medium and isolated and purified therefrom by 
conventional methods (see e.g., Example 4). The 
resulting heterodimer may be characterized by methods 

15 described herein. 

Cell lines generated from the approaches 
described above can be used to produce co-expressed, 
heterodimeric BMP polypeptides. The heterodimer ic 
proteins are isolated from the cell medium in a form 

20 substantially free from other proteins with which they 

are co-produced as well as from other contaminants found 
in the host cells by conventional purification 
techniques. The presently preferred method of production 
is co-transfection of different vectors into CHO cells 

25 and methotrexate-mediated gene amplification. Stable 
cell lines may be used to generate conditioned media 
containing recombinant BMP that can be purified and 
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assayed for in vitro and in vivo activities. For 
example, the resulting heterodimer-producing cell lines 
obtained by any of the methods described herein may be 
screened for activity by the assays described in Examples 
8 and 9, RNA expression, and protein expression by sodium 
dodecyl sulfate polyacrylamide gel electrophoresis (SDS- 
PAGE) . 

The above-described methods of co-expression of 
the heterodimers of this invention utilize suitable host 
cells or cell lines. Suitable cell preferably include 
mammalian cells, such as Chinese hamster ovary cells 
(CHO) . The selection of suitable mammalian host cells 
and methods for transformation, -culture, amplification, 
screening and product production and purification are 
known in the art. See, e.g., Gething and Sambrook, 
Nature . 293 :620-625 (1981), or alternatively, Kaufman et 
al, Mol. Cell. Biol. . 5(7) : 1750-1759 (1985) or Howley et 
al, U. S. Patent 4,419,446. Other suitable mammalian 
cell lines are the CV-1 cell line, BHK cell lines and the 
293 cell line. The monkey COS-1 cell line is presently 
believed to be inefficient in BMP heterodimer production. 

Many strains of yeast cells known to those 
skilled in the art may also be available as host cells 
for expression of the polypeptides of the present 
invention, e.g., Saccharomvces cerevisiae . Additionally, 
where desired, insect cells may be utilized as host cells 
in the method of the present invention. See, e.g., 
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Miller et al f B>nrt<e Eng ineering. 8:277-298 (Plenum 
Press 1986) and references cited therein. 

Another method for producing a biologically 
active heterodimeric protein of this invention may be 
5 employed where the host cells are microbial, preferably 

bacterial cells, in particular JL. coli. For example, the 
various strains of E_s_ coli (e.g. , HB101, MC1061) are 
well-known as host cells in the field of biotechnology. 
Various strains of L. subtilis . Pseudomonas, other 

10 bacilli and the like may also be employed in this method. 

This method, which may be employed to produce 
monomers and dimers (both homodimers and heterodimers) is 
described in European Patent Application No. 433,225, 
incorporated herein by reference. Briefly, this process 

15 involves culturing a microbial host comprising a 

nucleotide sequence encoding the desired BMP protein 
linked in the proper reading frame to an expression 
control sequence which permits expression of the protein 
and recovering the monomer ic, soluble protein. Where the 

20 protein is insoluble in the host cells, the water- 
insoluble protein fraction is isolated from the host 
cells and the protein is solubilized. After 
chromatographic purification, the solubilized protein is 
subjected to selected conditions to obtain the 

25 biologically active dimeric configuration of the protein. 
This process, which may be employed to produce the 
heterodimers of this invention, is described specifically 



in Example 7, for the production of a BMP-2 homodimer - 

Another aspect of the present invention 
provides DNA molecules or plasmid vectors for use in 
expression of these recombinant heterodimers. These 
plasmid vectors may be constructed by resort to known 
methods and available components known to those of skill 
in the art. In general, to generate a vector useful in 
the methods of this invention, the DNA encoding the 
desired BMP protein is transferred into one or more 
appropriate expression vectors suitable for the selected 
host cell. 

It is presently contemplated that any 
expression vector suitable for efficient expression in 
mammalian cells may be employed to produce the 
recombinant heterodimers of this invention in mammalian 
host cells. Preferably the vectors contain the selected 
BMP DNA sequences described above and in the Figures, 
which encode selected BMP components of the heterodimer. 
Alternatively, vectors incorporating modified sequences 
as described in the above-referenced patent applications 
are also embodiments of the present invention and useful 
in the production of the vectors. 

In addition to the specific vectors described 
in Example 1, one skilled in the art can construct 
mammalian expression vectors by employing the sequence of 
Figures 1-6 or other DNA sequences containing the coding 
sequences of Figures 1-6 (SEQ ID NOs: 1, 3, 5, 7, 9 and 



WO 93/09229 



rci/ uJj*/ «>w^« 



30 

11) , or other modified sequences and known vectors, such 
as pCD [Okayama et al, Mm . cell Biol. . 2:161-170 (1982)] 
and pJL3, pJL4 [Gough et al, EMBO J . , 1:645-653 (1985)]. 
The BMP DNA sequences can be modified by removing the 
5 non-coding nucleotides on the 5' and 3' ends of the 

coding region. The deleted non-coding nucleotides may or 
may not be replaced by other sequences known to be 
beneficial for expression. The transformation of these 
vectors into appropriate host cells as described above 

10 can produce desired heterodimers . 

One skilled in the art could manipulate the 
sequences of Figures 1-6 by eliminating or replacing the 
mammalian regulatory sequences flanking the coding 
sequence with e.g., yeast or insect regulatory sequences, 

15 to create vectors for intracellular or extracellular 
expression by yeast or insect cells. [See, e.g., 
procedures described in published European Patent 
Application 155,476] for expression in insect cells; and 
procedures described in published PCT application 

20 WO86/00639 and European Patent Application EPA 123,289 
for expression in yeast cells] . 

Similarly, bacterial sequences and preference 
codons may replace sequences in the described and 
exemplified mammalian vectors to create suitable 

25 expression systems for use in the production of BMP 

monomers in the method described above. For example, the 
coding sequences could be further manipulated (e.g., 



ligated to other known linkers or modified by deleting 
b non-coding sequences therefrom or altering nucleotides 

therein by other known techniques) . The modified BMP 
coding sequences could then be inserted into a known 
5 bacterial vector using procedures such as described in T. 
Taniguchi et al, Proc. Natl. Acad, Sci. USA , 77:5230-5233 
(1980) . The exemplary bacterial vector could then be 
transformed into bacterial host cells and BMP 
heterodimers expressed thereby. An exemplary vector for 

10 microbial, e.g., bacterial, expression is described below 
in Example 7. 

Other vectors useful in the methods of this 
invention may contain multiple genes in a single 
transcription unit. For example, a proposed plasmid 

15 p7E2D contains the BMP-7 gene followed by the EMC leader 
sequence, followed by the BMP-2 gene, followed by the 
DHFR marker gene. Another example is plasmid p7E2ED 
which contains the BMP-7 gene, the EMC leader, the BMP-2 
gene, another EMC leader sequence and the DHFR marker 

20 gene. Alternatively, the vector may contain more than 
one transcription unit. As one example, the plasmid 
p2ED7ED contains a transcription unit for BMP-2 and a 
separate transcription unit for BMP-7, i.e., BMP-2 -EMC- 
DHFR and BMP-7 -EMC-DHFR. Alternatively, each 

25 transcription unit on the plasmid may contain a different 
marker gene. For example, plasmid p2EN7ED contains BMP- 
2-EMC-Neo and BMP-7 -EMC-DHFR. 
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Additionally the vectors also contain 
appropriate expression control sequences which are 
capable of directing the replication and expression of 
the BMP in the selected host cells. Useful regulatory 
5 sequences for such vectors are known to one of skill in 
the art and may be selected depending upon the selected 
host cells. Such selection is routine and does not form 
part of the present invention. Similarly, the vectors 
may contain one or more selection markers, such as the 

10 antibiotic resistance gene, Neo or selectable markers 
such as DHFR and ADA. The presently preferred marker 
gene is DHFR. These marker genes may also be selected by 
one of skill in the art. 

Once they are expressed by one of the methods 

15 described above, the heterodimers of this invention may 
be identified and characterized by application of a 
variety of assays and procedures. A co-precipitation 
(immunoprecipitation) assay may be performed with 
antibodies to each of the BMPs forming the heterodimer. 

20 Generally antibodies for this use may be developed by 
conventional means, e.g., using the selected BMP, 
fragments thereof, or synthetic BMP peptides as antigen. 
Antibodies employed in assays are generally polyclonal 
antibodies made from individual BMP peptides or proteins 

25 injected into rabbits according to classical techniques. 
This assay is performed conventionally, and permits the 
identification of the heterodimer, which is precipitated 
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by antibodies to both BMP components of the heterodimer . 
In contrast, only one of the two antibodies causes 
precipitation of any homodimeric form which may be 
produced in the process of producing the heterodimer. 

Another characterizing assay is a Western 
assay, employing a precipitating antibody, a probing 
antibody and a detecting antibody. This assay may also 
be performed conventionally, by using an antibody to one 
of the BMPs to precipitate the dimers, which are run on 
reducing SDS-PAGE for Western analysis. An antibody to 
the second BMP is used to probe the precipitates on the 
Western gel for the heterodimer. A detecting antibody, 
such as a goat-antirabbit antibody labelled with 
horseradish peroxidase (HRP) , is then applied, which will 
reveal the presence of one of the component subunits of 
the heterodimer. 

Finally, the specific activity of the 
heterodimer may be quantitated as described in detail in 
Example 6. Briefly, the amount of each BMP is 
quantitated using Western blot analysis or pulse 
labelling and SDS-PAGE analysis in samples of each BMP 
homodimer and the heterodimer. The W20 activity is also 
determined as described specifically in Example 8. The 
relative specific activities may be calculated by the 
formula: W20 alkaline phosphatase activity/ amount of BMP 
on Western blot or by f luorography. As one example, this 
formula has been determined for the BMP-2/7 heterodimer, 
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demonstrating that the heterodimer has an estimated 5 to 
50 fold higher specific activity than the BMP-2 
homodimer. 

The heterodimers of the present invention may 
5 have a variety of therapeutic and pharmaceutical uses, 
e.g., in compositions for wound healing, tissue repair, 
and in similar compositions which have been indicated for 
use of the individual BMPs. Increased potency of the 
heterodimers over the individual BMPs may permit lower 

10 dosages of the compositions in which they are contained 
to be administered to a patient in comparison to dosages 
of compositions containing only a single BMP. A 
heterodimeric protein of the present invention, which 
induces cartilage and/ or bone growth in circumstances 

15 where bone is not normally formed, has application in the 
healing of bone fractures and cartilage defects in humans 
and other animals. Such a preparation employing a 
heterodimeric protein of the invention may have 
prophylactic use in closed as well as open fracture 

20 reduction and also in the improved fixation of artificial 
joints. De novo bone formation induced by an osteogenic 
agent contributes to the repair of congenital, trauma 
induced, or oncologic resection induced craniofacial 
defects, and also is useful in cosmetic plastic surgery. 

2 5 a heterodimeric protein of this invention may 

be used in the treatment of periodontal disease, and in 
other tooth repair processes. Such agents may provide an 
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environment to attract bone-forming cells , stimulate 
growth of bone-forming cells or induce differentiation of 
progenitors of bone-forming cells. Heterodimeric 
polypeptides of the invention may also be useful in the 
treatment of osteoporosis. A variety of osteogenic, 
cartilage-inducing and bone inducing factors have been 
described. See, e.g., European Patent Applications 
148,155 and 169,016 for discussions thereof. 

The proteins of the invention may also be used 
in wound healing and related tissue repair. The types of 
wojands include, but are not limited to burns, incisions 
and ulcers. (See, e.g., PCT Publication WO84/01106 
incorporated by reference herein for discussion of wound 
healing and related tissue repair) . 

Additionally, the proteins of the invention may 
increase neuronal survival and therefore be useful in 
transplantation and treatment of conditions exhibiting a 
decrease in neuronal survival. 

In view of the usefulness of the heterodimers, 
therefore, a further aspect of the invention is a 
therapeutic method and composition for repairing 
fractures and other conditions related to cartilage 
and/or bone defects or periodontal diseases. In 
addition, the invention comprises therapeutic methods and 
compositions for wound healing and tissue repair. Such 
compositions comprise a therapeutically effective amount 
of a heterodimeric protein of the invention in admixture 
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with a pharmaceutical^ acceptable vehicle, carrier or 
matrix. The preparation and formulation of such 
physiologically acceptable protein compositions, having 
due regard to pH, isotonicity, stability and the like, is 
within the skill of the art. 

It is expected that the proteins of the 
invention may act in concert with other related proteins 
and growth factors. Therapeutic methods and compositions 
of the invention therefore comprise a therapeutic amount 
of a heterodimeric protein of the invention with a 
therapeutic amount of at least one of the other BMP 
proteins disclosed in co-owned and concurrently filed U. 
S. applications described above. Such combinations may 
comprise separate molecules of the BMP proteins or other 
15 heteromolecules of the present invention. 

In further compositions, heterodimeric proteins 
of the invention may be combined with other agents 
beneficial to the treatment of the bone and/ or cartilage 
defect, wound, or tissue in question. These agents 
20 include various growth factors such as epidermal growth 
factor (EGF) , platelet derived growth factor (PDGF) , 
transforming growth factors (TGF-cr and TGF-0) , and 
insulin-like growth factor (IGF) . 

The therapeutic compositions are also presently 
25 valuable for veterinary applications due to the lack of 
species specificity in BMP proteins, particularly 
domestic animals and thoroughbred horses, in addition to 



humans , are desired patients for such treatment with 
heterodimeric proteins of the present invention. 

The therapeutic method includes administering 
the composition topically, systematically, or locally as 
an implant or device. When administered, the therapeutic 
composition for use in this invention is, of course, in a 
pyrogen-free, physiologically acceptable form. Further, 
the composition may desirably be encapsulated or injected 
in a viscous form for delivery to the site of bone, 
cartilage or tissue damage. Topical administration may 
be suitable for wound healing and tissue repair. 
Therapeutically useful agents other than the 
heterodimeric proteins of the invention which may also 
optionally be included in the composition as described 
above, may alternatively or additionally, be administered 
simultaneously or sequentially with the heterodimeric BMP 
composition in the methods of the invention. Preferably 
for bone and/ or cartilage formation, the composition 
would include a matrix capable of delivering the 
heterodimeric protein-containing composition to the site 
of bone and/or cartilage damage, providing a structure 
for the developing bone and cartilage and optimally 
capable of being resorbed into the body. Such matrices 
may be formed of materials presently in use for other 
implanted medical applications. 

The choice of matrix material is based on 
biocompatibility, biodegradability , mechanical 
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properties, cosmetic appearance and interface properties. 
The particular application of the heterodimeric BMP 
compositions will define the appropriate formulation. 
Potential matrices for the compositions may be 
5 biodegradable and chemically defined calcium sulfate, 
tricalciumphosphate, hydroxy apatite, poly lactic acid, 
polyglycolic acid and polyanhydrides. Other potential 
materials are biodegradable and biologically well 
defined, such as bone or dermal collagen . Further 

10 matrices are comprised of pure proteins or extracellular 
matrix components. Other potential matrices are 
nonbiodegradable and chemically defined, such as sintered 
hydroxyapatite, bioglass, aluminates, or other ceramics. 
Matrices may be comprised of combinations of any of the 

15 above mentioned types of material, such as poly lactic 
acid and hydroxyapatite or collagen and 
tricalciumphosphate. The bioceramics may be altered in 
composition, such as in calcium-alumina te-phosphate and 
processing to alter pore size, particle size, particle 

20 shape, and biodegradability. 

Presently preferred is a 50:50 (mole weight) 
copolymer of lactic acid and glycolic acid in the form of 
porous particles having diameters ranging from 150 to 800 
microns. In some applicatons, it will be useful to 

25 utilize a sequestering agent, such as carboxymethyl 

cellulose or autologous blood clot, to prevent the BMP 
compositions from dissassociating from the matrix. 
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The dosage regimen of a heterodimeric protein- 
containing pharmaceutical composition will be determined 
by the attending physician considering various factors 
which modify the action of the heterodimeric proteins, 
e.g. amount of bone weight desired to be formed, the site 
of bone damage, the condition of the damaged bone, the 
size of a wound, type of damaged tissue, the patient's 
age, sex, and diet, the severity of any infection, time 
of administration and other clinical factors. The dosage 
may vary with the type of matrix used in the 
reconstitution and the BMP proteins in the heterodimer 
and any additional BMP or other proteins in the 
pharmaceutical composition. For example, the addition of 
other known growth factors, such as IGF I (insulin like 
growth factor I) , to the final composition, may also 
effect the dosage. Progress can be monitored by periodic 
assessment of bone growth and/or repair, for example, X- 
rays, histomorphometric determinations and tetracycline 
labeling. 

The following examples are illustrative of the 
present invention and do not limit its scope. 

EXAMPLE 1 - BMP Vector Constructs and Cell Lines 
A. BMP -2 Vectors 

The mammalian expression vector pMT2 CXM 
is a derivative of p91023 (b) [Wong et al, Science, 
228:810-815 (1985)] differing from the latter in that it 



WO 93/09229 



10 



40 

contains the ampicillin resistance gene (Amp) in place of 
the tetracycline resistance gene (Tet) and further 
contains a Xhol site for insertion of cDNA clones. The 
functional elements of pMT2 CXM have been described [R. 
J. Kaufman, Prnr.. Natl. &n a d. Sci. USA, 81:689-693 
(1985)] and include the adenovirus VA genes, the SV40 
origin of replication including the 72 bp enhancer, the 
adenovirus major late promoter including a 5' splice site 
and the majority of the adenovirus tripartite leader 
sequence present on adenovirus late mRNAs, a 3' splice 
acceptor site, a DHFR insert, the SV40 early 
polyadenylation site (SV40) , and pBR322 sequences needed 
for propagation in IL_ coll. 

EcoRI digestion of pMT2-VWF, which has 
15 been deposited with the American Type Culture Collection 
(ATCC) , Rockville, MD (USA) under accession number ATCC 
67122, excises the cDNA insert present in pMT2-VWF, 
yielding pMT2 in linear form. Plasmid pMT2 can be 
ligated and used to transform E^. coli HB 101 or DH-5 to 
20 ampicillin resistance. Plasmid pMT2 DNA can be prepared 
by conventional methods. 

Plasmid pMT2 CXM is then constructed using 
loopout/in mutagenesis [Morinaga et al, Biotechnology , 
84:636 (1984)]. This removes bases 1075 to 1145 relative 
25 to the Hindlll site near the SV40 origin of replication 
and enhancer sequences of pMT2. In addition it inserts 
the following sequence: 
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5' P0 4 -CATGGGCAGCTCGAG-3 ' (SEQ ID NO: 15) 
at nucleotide 1145, This sequence contains the 
recognition site for the restriction endonuclease Xhol. 

A derivative of pMT2 CXM, termed plasmid pMT23, 
contains recognition sites for the restriction 
endonucleases PstI, EcoRI, Sail and Xhol. 

Full length BMP-2 cDNA (Fig. 1) (SEQ ID NO: 1) 
is released from the XGT10 vector by digestion with EcoRI 
and subcloned into pSP65 [Promega Biotec, Madison, 
Wisconsin; see, e.g., Melton et al, Nucl. Acids Res. , 
12:7035-7056 (1984)] in both orientations yielding pBMP-2 
#39-3 or pBMP-2 #39-4. 

The majority of the untranslated regions of the 

BMP-2 cDNA are removed in the following manner. The 5 7 

sequences are removed between the Sail site in the 

adapter (present from the original cDNA cloning) and the 

Sail site 7 base pairs upstream of the initiator ATG by 

digestion of the pSP65 plasmid containing the BMP-2 cDNA 

with Sail and religation. The 3' untranslated region is 

removed using heteroduplex mutagenesis using the 

oligonucleotide 

5' GAGGGTTGTGGGTGTCGCTAGTGAGTCGACTACAGCAAAATT 3'. 

End Sail 

(SEQ ID NO: 16) 

The sequence contains the terminal 3' coding region of 
the BMP-2 cDNA, followed immediately by a recognition 
site for Sail. The sequence introduces a Sail site 
following the termination (TAG) codon. 



WO 93/09229 



42 

The Sail fragment of this clone was subcloned 
into the expression vector pMT23, yielding the vector 
PMT23-BMP2aTJT. Restriction enzyme sites flank the BMP-2 
coding region in the sequence PstI-EcoRI-SalI-BMP-2 cDNA- 

5 Sall-EcoRI-XhoI. 

The expression plasmid pED4 [Kaufman et al, 
unni . Acids Res. , 19:4485-4490 (1991)] was linearized by 
digestion with EcoRl and treated with calf intestinal 
phosphatase. The BMP-2 cDNA gene was excised from pMT23- 

10 BMP2AUT by digestion with EcoRI and recovery of the 1.2 
kb fragment by electrophoresis through a 1.0% low melt 
agarose gel. The linearized pED4 vector and the EcoRI 
BMP-2 fragment were ligated together, yielding the BMP-2 
expression plasmid pBMP2A-EMC. 

15 Another vector pBMP-2A-EN contains the same 

sequences contained within the vector pBMP2A-EMC, except 
the DHFR gene has been replaced by conventional means 
with the neomycin resistance gene from the Tn5 
transposable element. 

20 B. BMP4 Vectors 

A BMP-4 cDNA sequence set forth in Figure 
2 (SEQ ID NO: 3), in which the 3' untranslated region is 
removed, is made via heteroduplex mutagenesis with the 
mutagenic oligonucleotide: 
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5' GGATGTGGGTGCCG CTGA CTCTAGAGTCGACG GAATTC 3' 

End EcoRI 

(SEQ ID NO: 17) 

This deletes all of the sequences 3 ' to the translation 

terminator codon of the BMP-4 cDNA, juxtaposing this 

terminator codon and the vector polylinker sequences. 

This step is performed in an SP65 vector [Promega 

Biotech] and may also be conveniently performed in pMT2- 

derivatives containing the BMP-4 cDNA similar to the BMP2 

vectors described above* The 5 ' untranslated region is 

removed using the restriction endonuclease BsmI, which 

cleaves within the eighth codon of BMP-4 cDNA. 

Reconstruction of the first eight codons 

is accomplished by ligation to oligonucleotides: 

EcoRI Initiator BsmI 
5' AATTCACCATGATTCCTGGTAACC GAATGCT 3' (SEQ ID NO: 18) 

and 

3' GTGGTACTAAGGACCATTGGCTTAC 5 ' (SEQ ID NO: 19) 

These oligonucleotides form a duplex which has a BsmI 
complementary cohesive end capable of ligation to the 
BsmI restricted BMP-4 cDNA, and it has an EcoRI 
complementary cohesive end capable of ligation to the 
EcoRI restricted vector pMT2. Thus the cDNA for BMP-4 
with the 5 ' and 3' untranslated regions deleted, and 
retaining the entire encoding sequence is contained 
within an EcoRI restriction fragment of approximately 1.2 
kb. 

The pMT2 CXM plasmid containing this BMP-4 
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sequence is designated P XMBMP-4aUT. It is digested with 
EcoRI in order to release the BMP-4 cDNA containing 
insert from the vector. This insert is subcloned into 
the EcoRI site of the mammalian expression vector pED4 # 
5 resulting pBMP4A-EMC. 

C. bmp-5 vectors 

A BMP-5 cDNA sequence comprising the 
nucleotide sequence from nucleotide #699 to #2070 of Fig. 
5 (SEQ ID NO: 9) is specifically amplified as follows. 

10 The oligonucleotides CGACCTGCAGCCACCATGCATCTGACTGTA (SEQ 

ID NO: 20) and TGCCTGCAGTTTAATATTAGTGGCAGC (SEQ ID NO: 
21) are utilized as primers to allow the amplification of 
nucleotide sequence #699 to #2070 of Fig. 5 from the BMP- 
5 insert of X-ZAP clone U2-16 [ATCC #68109]. This 

15 procedure introduces the nucleotide sequence 

CGACCTGCAGCCACC (SEQ ID NO: 22) immediately preceeding 
nucleotide #699 and the nucleotide sequence CTGCAGGCA 
immediately following nucleotide #2070. The addition of 
these sequences results in the creation of PstI 

20 restriction endonuclease recognition sites at both ends 
of the amplified DNA fragment. The resulting amplified 
DNA product of this procedure is digested with the 
restriction endonuclease PstI and subcloned into the PstI 
site of the pMT2 derivative pMT21 [Kaufman, Nucl. Acids 

25 Res. . 19:4485-4490 (1991)]. The resulting clone is 
designated H5/5/pMT. 

The insert of H5/5/pMT is excised by PstI 
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digestion and subcloned into the plasmid vector pSP65 
[Promega Biotech] at the PstI site, resulting in plasmid 
BMP5/SP6. BMP5/SP6 and U2-16 are digested with the 
restriction endonucleases Nsil and Ndel to excise the 
5 portion of their inserts corresponding to nucleotides 

#704 to #1876 of Fig. 5. The resulting 1173 nucleotide 
Nsil-Ndel fragment of clone U2-16 is ligated into the 
Nsil-Ndel site of BMP5/SP6 from which the corresponding 
1173 nucleotide Nsil-Ndel fragment had been removed. The 

10 resulting clone is designated BMP5mix/SP65. 

Direct DNA sequence analysis of BMP5mix/SP65 is 
performed to confirm identity of the nucleotide sequences 
produced by the amplification to those set forth in Fig. 
5. The clone BMP5mix/SP65 is digested with the 

15 restriction endonuclease PstI resulting in the excision 
of an insert comprising the nucleotides #699 to #2070 of 
Fig. 5 and the additional sequences containing the PstI 
recognition sites as described above. The resulting 1382 
nucleotide PstI fragment is subcloned into the PstI site 

20 of the pMT2 derivative pMT21. This clone is designated 
BMP5mix/pMT21#2. 

The same fragment is also subcloned into the 
PstI site of pED4 to yield the vector designated BMPSmix- 
EMC-11. 

25 D. BMP- 6 Vectors 

A BMP-6 cDNA sequence comprising the 
nucleotide sequence from nucleotide #160 to #1706 of 
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Fig. 4 (SEQ ID NO: 7) is produced by a series of 
techniques known to those skilled in the art. The clone 
BMP6C35 [ATCC 68245] is digested with the restriction 
endonucleases Apal and TaqI, resulting in the excision of 
a 1476 nucleotide portion of the insert comprising 
nucleotide #231 to #1703 of Fig. 4. Synthetic 
oligonucleotides with Sail restriction endonuclease site 
converters are designed to replace those nucleotides 
corresponding to #160 to #230 and #1704 to #1706 which 
are not contained in the 1476 Apal-TaqI fragment of the 

BMP-6 cDNA sequence. 

Oligonucleotide/ Sail converters conceived to 

replace the missing 5' 

(TCGACCCACCATGCCGGGGCTGGGGCGGAGGGCGCAGTGGCTGT 
GCTGGTGGTGGGGGCTGTGCTGCAGCTGCTGCGGGCC (SEQ ID NO: 23) and 
CGCAGCAGCTGCACAGCAGCCCCCACCACCAGCACAGCCACTGCGCCCTCCGCCCCA 

GCCCCGGCATGGTGGG) (SEQ ID NO: 24) and 3' (TCGACTGGTTT 
(SEQ ID NO: 25) and CGAAACCAG (SEQ ID NO: 26) ) sequences 
are annealed to each other independently. The annealed 
5' and 3' converters are then ligated to the 1476 
nucleotide Apal-TaqI described above, creating a 1563 
nucleotide fragment comprising the nucleotide sequence 
from #160 to #1706 of Fig. 4 and the additional sequences 
contrived to create Sail restriction endonuclease sites 
at both ends. The resulting 1563 nucleotide fragment is 
subcloned into the Sail site of pSP64 [Promega Biotech, 
Madison, WIJ. This clone is designated BMP6/SP64#15. 
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DNA sequence analysis of BMP6/SP64#15 is 
performed to confirm identity of the 5' and 3' sequences 
replaced by the converters to the sequence set forth in 
Fig. 4. The insert of BMP6/SP64#15 is excised by 
5 digestion with the restriction endonuclease Sail. The 

resulting 1563 nucleotide Sail fragment is subcloned into 
the Xhol restriction endonuclease site of pMT21 and 
designated herein as BMP6/pMT21. 

The PstI site of pED4 is converted to a Sail 
10 site by digestion of the plasmid with PstI and ligation 
to the converter oligonucleotides: 

5'-TCGACAGGCTCGCCTGCA-3' (SEQ ID NO: 27) and 
3 ' -GTCCGAGCGG-5 ' ( SEQ ID NO: 28). 

The above 1563 nucleotide Sail fragment is also subcloned 
15 into the Sail site of this pED4 vector, yielding the 
expression vector BMP6/EMC. 

E. BMP- 7 Vectors 

A BMP-7 sequence comprising the nucleotide 
sequence from nucleotide #97 to #1402 of Fig. 3 (SEQ ID 
20 NO: 5) is specifically amplified as follows. The 

oligonucleotides CAGGTCGACCCACCATGCACGTGCGCTCA (SEQ ID 
NO: 29) and TCTGTCGACCTCGGAGGAGCTAGTGGC (SEQ ID NO: 30) 
are utilized as primers to allow the amplification of 
nucleotide sequence #97 to #1402 of Fig. 3 from the 
25 insert of clone PEH7-9 [ATCC #68182]. This procedure 

generates the insertion of the nucleotide sequence 
CAGGTCGACCCACC immediately preceeding nucleotide #97 and 
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the insertion of the nucleotide sequence GTCGACAGA 
immediately following nucleotide #1402. The addition of 
these sequences results in the creation of a Sail 
restriction endonuclease recognition site at each end of 
5 the amplified DNA fragment. The resulting amplified DNA 
product of this procedure is digested with the 
restriction endonuclease Sail and subcloned into the Sail 
site of the plasmid vector pSP64 [Promega Biotech, 
Madison, WI] resulting in BMP7/SP6#2. 

10 The clones BMP7/SP6#2 and PEH7-9 are digested 

with the restriction endonucleases Ncol and StuI to 
excise the portion of their inserts corresponding to 
nucleotides #363 to #1081 of Fig. 3. The resulting 719 
nucleotide NcoI-StuI fragment of clone PEH7-9 is ligated 

15 into the NcoI-StuI site of BMP7/SP6#2 from which the 

corresponding 719 nucleotide fragment is removed. The 
resulting clone is designated BMP7mix/SP6. 

Direct DNA sequence analysis of BMP7mix/SP6 
confirmed identity of the 3' region to the nucleotide 

20 sequence from #1082 to #1402 of Fig. 3, however the 5' 
region contained one nucleotide mis incorporation. 

Amplification of the nucleotide sequence (#97 
to #1402 of Fig. 3) utilizing PEH7-9 as a template is 
repeated as described above. The resulting amplified DNA 

25 product of this procedure is digested with the 

restriction endonucleases Sail and Pstl. This digestion 
results in the excision of a 747 nucleotide fragment 
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comprising nucleotide #97 to #833 of Fig. 3 plus the 
additional sequences of the 5' priming oligonucleotide 
used to create the Sail restriction endonuclease 
recognition site described earlier. This 747 Sall-PstI 
5 fragment is subcloned into a Sall-PstI digested pSP65 
[Promega Biotech, Madison, WI] vector resulting in 
5'BMP7/SP65. DNA sequence analysis demonstrates that the 
insert of the 5 / BMP7/SP65#l comprises a sequence 
identical to nucleotide #97 to #362 of Fig. 3. 

10 The clones BMP7mix/SP6 and 5 / BMP7/SP65 are 

digested with the restriction endonucleases Sail and 
Ncol. The resulting 3' Ncol-Sall fragment of BMP7mix/SP6 
comprising nucleotides #363 to #1402 of Fig. 3 and 5' 
Sall-Ncol fragment of 5 / BMP7/SP65 comprising nucleotides 

15 #97 to #362 of Fig. 3 are ligated together at the Ncol 
restriction sites to produce a 1317 nucleotide fragment 
comprising nucleotides #97 to #1402 of Fig. 3 plus the 
additional sequences derived from the 5' and 3' 
oligonucleotide primers which allows the creation of Sail 

20 restriction sites at both ends of this fragment. 

This 1317 nucleotide Sail fragment is 
ligated nto the Sail site of the pMT2 derivative pMT2Cla- 
2. pMT2Cla-2 is constructed by digesting pMT21 with 
EcoRV and Xhol, treating the digested DNA with Klenow 

25 fragment of DNA polymerase I and ligating Clal linkers 

(NEBio Labs, CATCGATG) . This removes bases 2171 to 2420 
starting from the Hindlll site near the SV40 origin of 
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replication and enhancer sequences of pMT2 and introduces 
a unique Clal site, but leaves the adenovirus VAI gene 
intact, resulting in pMT2Cla-2. This clone is designated 
BMP-7-pMT2 . 

5 The insert of BMP-7-pMT2 is excised by 

digestion with the restriction endonuclease Sail. The 
resulting 1317 nucleotide Sail fragment is subcloned into 
the Xhol restriction endonuclease site of pMT21 to yield 
the clone BMP-7/pMT21. This Sail fragment is also 

10 subcloned into the Sail site of the pED4 vector in which 
the PstI site was converted into a Sail site as described 
above, resulting in the vector pBMP7/EMC#4. 
F. BMP-8 Vectors 

At present no mammalian BMP-8 vectors have 

15 been constructed. However, using the sequence of Figure 
6 (SEQ ID NO: 11), it is contemplated that vectors 
similar to those described above for the other BMPs may 
be readily constructed. A bacterial expression vector 
similar to the BMP-2 vector described in detail in 

20 Example 7 may also be constructed for BMP-8, by 

introducing a Met before the amino acid #284 Ala of Fig. 
6. This sequence of BMP-8 is inserted into the vector 
PALBP2-781 in place of the BMP-2 sequence. See Example 
7. 

25 g. BMP Vectors Containing the Ad enosine 

neaminase f Ada^ Marker 

BMP genes were inserted into the vector 
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pMT3SV2Ada [R. J. Kaufman, Meth . Enz . . 185 :537-566 
(1990)] to yield expression plasmids containing separate 
transcription units for the BMP cDNA gene and the 
selectable marker Ada. pMT3SV2Ada contains a polylinker 
5 with recognition sites for the enzymes PstI, EcoRI, Sail 

and Xbal that can be used for insertion of and expression 
of genes (i.e. BMP) in mammalian cells. In addition, the 
vector contains a second transcription unit encoding Ada 
which serves as a dominant and amplifiable marker in 

10 mammalian cells. 

To construct expression vectors for BMP-5, BMP- 
6 and BMP-7, individually, the same general method was 
employed. The gene for BMP 5 (Fig. 5) , 6 (Fig. 4) or 7 
(Fig. 3) was inserted into the polylinker essentially as 

15 described above for the pED4 vector. These vectors can 
be used for transfection into CHO DUKX cells and 
subsequent selection and amplification using the Ada 
marker as previously described [Kaufman et al, Proc. 
Natl. Acad. Sci. USA . 83:3136-3140 (1986)]. Since each 

20 such vector does not contain a DHFR gene, the resultant 
transformed cells remain DHFR negative and can be 
subsequently transfected with a second vector containing 
a different BMP in conjunction with DHFR and amplified 
with methotrexate. 

25 Alternatively, the pMT3SV2Ada/BMP vectors can 

be used to transfect stable CHO cell lines previously 
transfected with a different BMP gene and amplified using 
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the DHFR/methotrexate system. The resultant 
transfectants can be subsequently amplified using the Ada 
system, yielding cell lines that coexpress two different 
BMP genes, and are amplified using both the DHFR and Ada 
5 markers. 

H. BMP-Expressing M ammalian r.^ll Lines 

At present, the most desirable mammalian 
cell lines for use in producing the recombinant 
homodimers and heterodimers of this invention are the 
10 following. These cell lines were prepared by 

conventional transformation of CHO cells using vectors 

described above. 

The BMP-2 expressing cell line 2EG5 is a 
CHO cell stably transformed with the vector pBMP2delta- 
15 EMC. 

The BMP-4 expressing cell line 4E9 is a 
CHO cell stably transformed with the vector pBMP4delta- 
EMC. 

The BMP-5 expressing cell line 5E10 is a 
20 CHO cell stably transformed with the vector BMP5mix-EMC- 
11 (at a amplification level of 2 micromolar MTX) . 

The BMP-6 expressing cell line 6HG8 is a 
CHO cell stably transformed with the vector BMP6/EMC. 

The BMP-7 expressing cell line 7MB9 is a 
25 CHO cell stably transformed with the vector BMP7/pMT21. 



EVAMPLE 2 - TRANSIENT EXPRESSI ON OF BMP HETERODIMERS 
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The heterodimers of the present invention may 
be prepared by co-expression in a transient expression 
system for screening in the assays of Example 8 by two 
different techniques as follows. 
5 In the first procedure, the pMT2-derived and 

EMC- derived expression plasmids described in Example 1 
and other similarly derived vectors were constructed 
which encoded, individually, BMP-2 through BMP-7, and 
transforming growth factor-beta (TGF/Jl) . All 

10 combinations of pairs of plasmids were mixed in equal 

proportion and used to co-transfect CHO cells using the 
DEAE-dextran procedure [Sompayrac and Danna, Proc. Natl. 
Acad. Sci. USA . 78:7575-7578 (1981); Luthman and 
Magnusson, Nucl. Acids Res. . 11:1295-1308 (1983)]. The 

15 cells are grown in alpha Minimal Essential Medium (a-MEM) 
supplemented with 10% fetal bovine serum, adenosine, 
deoxyadenosine, thymidine (100 tig/ml each), pen/strep, 
and glutamine (1 mM) . 

The addition of compounds such as heparin, 

20 suramin and dextran sulfate are desirable in growth 

medium to increase the amounts of BMP-2 present in the 
conditioned medium of CHO cells. Similarly responsive to 
such compounds is BMP-5. Therefore, it is expected that 
these compounds will be added to growth medium for any 

25 heterodimer containing these BMP components. Other BMPs 
may also be responsive to the effects of these compounds, 
which are believed to inhibit the interaction of the 
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mature BMP molecules with the cell surface. 

The following day, fresh growth medium, with or 
without 100 /xg/ml heparin, was added. Twenty-four hours 
later, conditioned medium was harvested. 
5 in some experiments, the conditioned medium was 

collected minus heparin for the 24-48 hour period post- 
transfection, and the same plates were then used to 
generate conditioned medium in the presence of heparin 
48-72 hour post-transfection. Controls included 
10 transfecting cells with expression plasmids lacking any 
BMP sequences, transfecting cells with plasmids 
containing sequences for only a single BMP, or mixing 
conditioned medium from cells transfected with a single 
BMP with conditioned medium from cells transfected with a 

15 different BMP. 

Characterizations of the coexpressed 
heterodimer BMPs in crude conditioned media, which is 
otherwise not purified, provided the following results. 
Transiently coexpressed BMP was assayed for induction of 

20 alkaline phosphatase activity on W20 stromal cells, as 
described in Example 8. 

Co-expression of BMP-2 with BMP-5, BMP -6 and 
BMP-7, and BMP-4 with BMP-5, BMP -6 and BMP-7 yielded more 
alkaline phosphatase inducing activity in the W20 assay 

25 than either of the individual BMP homodimers alone or 

mixtures of homodimers, as shown below. Maximal activity 
(in vitro ) , was obtained when BMP-2 was coexpressed with 
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BMP-7. Increased activity was 
heterodimers BMP-2/5; BMP-2/6; 
4/7. 



also found the 

BMP-4/5; BMP-4/6; and BMP- 



Condition Medium 
TGF-0 BMP-7 BMP-6 BMP-5 BMP-4 BMP-3 BMP-2 
BMP-2 33 240 99 89 53 9 29 

BMP-3 - - - - 14 

BMP-4 12 115 25 22 24 

BMP-5 - 
BMP-6 - 
BMP-7 - 
TGF-/3 - 



Condition Medium + heparin 
TGF-0 BMP-7 BMP-6 BMP-5 BMP-4 BMP-3 BMP-2 
BMP-2 88 454 132 127 70 77 169 

BMP-3 7 
BMP-4 7 119 30 41 37 

BMP-5 - 
BMP-6 - 
BMP-7 - 
TGF-0 - 



Units: 1 unit of activity is equivalent to that of 1 ng/ml of rhBMP-2. 
-: indicates activity below die detection limit of the assay. 



These BMP combinations were subsequently expressed 
5 using various ratios of expression plasmids (9:1, 3:1, 
1:1, 1:3, 1:9) during the CHO cell transient 
transf ection. The performance of this method using 
plasmids containing BMP-2 and plasmids containing BMP-7 
at plasmid number ratios ranging from 9:1 to 1:9, 
10 respectively, demonstrated that the highest activity in 
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the W20 assay was obtained when approximately the same 
number of plasmids of each BMP were transfected into the 
host cell. Ratios of BMP-2 to BMP-7 plasmids of 3:1 to 
1:3, respectively, also resulted in increased activity in 
5 W20 assay in comparison to host cells transfected with 
plasmids containing only a single BMP. However, these 
latter ratios produced less activity than the 1:1 ratio. 

Similar ratios may be determined by one of 
skill in the art for heterodimers consisting of other 

10 than BMP-2 and BMP-7. For example, preliminary work on 
the heterodimer formed between BMP-2 and BMP-6 has 
indicated that a preferred ratio of plasmids for co- 
transfection is 3:1, respectively. The determination of 
preferred ratios for this method is within the skill of 

15 the art. 

As an alternative means to transiently generate 
coexpressed BMPs, the stable CHO cell lines identified in 
Example 1 expressing each BMP-2, BMP-4, BMP-5, BMP-6 and 
BMP-7, are cocultured for one day, and are then fused 

20 with 46.7% polyethylene glycol (PEG). One day post- 
fusion, fresh medium is added and the heterodimers are 
harvested 24 hours later for the W20 assay, described in 
Example 8. The assay results were substantially similar 
to those described immediately above. 

25 Therefore, all combinations of BMP-2 or 4 

coexpressed with either BMP-5, 6 or 7 yielded greater 
activity than any of the BMP homodimers alone. In 
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control experiments where each BMP homodimer was 
expressed alone and conditioned media mixed post harvest, 
the activity was always intermediate between the 
individual BMPs, demonstrating that the BMP co-expressed 
heterodimers yield higher activity than combinations of 
the individually expressed BMP homodimers. 

EXAMPLE 3 - STABLE EXPRESSION OF BMP HETERODIMERS 
A. BMP-2/7 

Based on the results of the transient assays in 
Example 2, stable cell lines were made that co-express 
BMP-2 and BMP-7. 

A preferred stable cell line, 2E7E-10, was 
obtained as follows: Plasmid DNA (a 1:1 mixture of pBMP- 
7-EMC and pBMP-2-EMC, described in Example 1) is 
transfected into CHO cells by electroporation [Neuman et 
al, EMBO J, . 1:841-845 (1982)]. 

Two days later, cells are switched to selective 
medium containing 10% dialyzed fetal bovine serum and 
lacking nucleosides. Colonies expressing DHFR are 
counted 10-14 days later. Individual colonies or pools 
of colonies are expanded and analyzed for expression of 
each heterodimer BMP component RNA and protein using 
standard procedures and are subsequently selected for 
amplification by growth in increasing concentrations of 
MTX. Stepwise selection of the preferred clone, termed 
2E7E, is carried out up to a concentration of 0.5 fxK MTX. 
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The cell line is then subcloned and assayed for 
heterodimer 2/7 expression. 

Procedures for such assay include Western blot 
analysis to detect the presence of the component DNA, 
protein analysis and SDS-PAGE analysis of metabolically 
labelled protein, W20 assay, and analysis for cartilage 
and/or bone formation activity using the ectopic rat bone 
formation assay of Example 9. The presently preferred 
clonally-derived cell line is identified as 2E7E-10. 
This cell line secretes BMP-2/7 heterodimer proteins into 
the media containing 0.5 /xM MTX. 

The CHO cell line 2E7E-10 is grown in 
Dulbecco's modified Eagle's medium (DMEM) /Ham's nutrient 
mixture F-12, 1:1 (vol/vol), supplemented with 10% fetal 
bovine serum. When the cells are 80 to 100% confluent, 
the medium is replaced with serum-free DMEM/ F-12. Medium 
is harvested every 24 hours for 4 days. For protein 
production and purification the cells are cultured serum- 
free. 

While the co-expressing cell line 2E7E-10 
preliminarily appears to make lower amounts of BMP 
protein than the BMP2-expressing cell line 2EG5 described 
in Example 2, preliminary evidence suggests that the 
specific activity of the presumptive heterodimer is at 
least 5-fold greater than BMP-2 homodimer (see Example 
6). 

To construct another heterodimer producing cell 
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line, the stable CHO cell line 7MB9, previously 
transfected with pBMP-7-pMT2, and which expresses BMP-7, 
is employed. 7 MB 9 may be amplified and selected to 2 juM 
methotrexate resistance using the DHFR/MTX system. To 
generate a stable co-expressing cell line, cell line 7MB9 
is transfected with the expression vector pBMP-2A-EN 
(EMONeo) containing BMP-2 and the neomycin resistance 
gene from the Tn5 transposable element. The resulting 
transfected stable cell line was selected for both G-418 
and MTX resistance. Individual clones were picked and 
analyzed for BMP expression, as described above. 

It is anticipated that stable cell lines co- 
expressing other combinations of BMPs which show enhanced 
activity by transient coexpression will likewise yield 
greater activity upon stable expression. 

B. BMP-2/6 

Based on the results of the transient assays in 
Example 2, stable cell lines were made that co-express 
BMP-2 and BMP- 6. 

A preferred stable cell line, 12C07, was 
obtained as follows: Plasmid DNA (a 1:3 mixture of pBMP- 
6-EMC and pBMP-2-EMC, described in Example 1) is 
transfected into CHO cells by electroporation [Neuman et 
al, EMBO J. , 1:841-845 (1982)]. 

Two days later, cells are switched to selective 
medium containing 10% dialyzed fetal bovine serum and 
lacking nucleosides. Colonies expressing DHFR are 
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counted 10-14 days later. Individual colonies or pools 
of colonies are expanded and analyzed for expression of 
each heterodimer BMP component RNA and protein using 
standard procedures and are subsequently selected for 
5 amplification by growth in increasing concentrations of 
MTX. Stepwise selection of the preferred clone, termed 
12-C, is carried out up to a concentration of 2.0 fM MTX. 
The cell line is then subcloned and assayed for 
heterodimer 2/6 expression. 

10 procedures for such assay include Western blot 

analysis to detect the presence of the component DNA, 
protein analysis and SDS-PAGE analysis of metabolically 
labelled protein, W20 assay, and analysis for cartilage 
and/or bone formation activity using the ectopic rat bone 

15 formation assay of Example 9. The presently preferred 

clonally-derived cell line is identified as 12C07. This 
cell line secretes BMP-2/6 heterodimer proteins into the 
media containing 2.0 /jH MTX. 

The CHO cell line 12C07 is grown in Dulbecco's 

20 modified Eagle's medium (DMEM) /Ham's nutrient mixture F- 
12, 1:1 (vol/vol) , supplemented with 10% fetal bovine 
serum. When the cells are 80 to 100% confluent, the 
medium is replaced with serum-free DMEM/ F- 12 . Medium is 
harvested every 24 hours for 4 days. For protein 

25 production and purification the cells are cultured serum- 
free. 

While the co-expressing cell line 12C07 
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preliminarily appears to make lower amounts of BMP 
protein than the BMP2 -expressing cell line 2EG5 described 
in Example 2, preliminary evidence suggests that the 
specific activity of the presumptive heterodimer is at 
least 3-5-fold greater than BMP-2 homodimer (see Example 
6). 

To construct another heterodimer producing cell 
line, the stable CHO cell line 2EG5, previously 
transfected with pBMP-2-EMC, and which expresses BMP-2, 
is. employed. 2EG5 may be amplified and selected to 2 /iM 
methotrexate resistance using the DHFR/MTX system. To 
generate a stable co-expressing cell line, cell line 2EG5 
is transfected with the expression vector pBMP-6-ada (ada 
deaminase) containing BMP-6 and the ADA resistance gene. 
The resulting transfected stable cell line was selected 
for both DCF and MTX resistance. Individual clones are 
picked and analyzed for BMP expression, as described 
above. 

It is anticipated that stable cell lines co- 
expressing other combinations of BMPs which show enhanced 
activity by transient coexpression will likewise yield 
greater activity upon stable expression. 

EXAMPLE 4 -PURIFICATION OF BMP2/7 AND BMP-2 / 6 HETERODIMER 
The same purification procedure is used for BMP-2/ 6 
heterodimer and BMP-2/7 heterodimer. Conditioned media 
from cultures of cell line 2E7E-10 or 12C07 containing 
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recombinantly produced BMP heterodimer 2/7V or 2/6, 
respectively, can be generated from either adherent or 
suspension cultures. For small to medium scale 
generation of coexpressed BMP, adherent cultures are 
5 seeded into roller bottles and allowed to grow to 

confluence in alpha-Minimal Eagles Medium [a-MEM, Gibco, 
Grand Island, NY] containing 10% dialyzed heat- 
inactivated fetal calf serum [Hazleton, Denver, PA]. The 
media is then switched to a serum-free, albumin free, low 

10 protein medium based on a 50:50 mixture of Delbecco's 

Modified Eagle's medium and Hams F-12 medium, optionally 
supplemented with 100 micrograms/ml dextran sulfate. 
Four or five daily harvests are pooled, and used to 
purify the recombinant protein. 

15 Conditioned medium from roller bottle cultures 

obtained as described above was thawed slowly at room 
temperature and pooled. The pH of the pooled medium was 
adjusted to pH 8.0 using 1 M Tris, pH 8.0. A column was 
poured containing Matrex Cellufine Sulfate [Amicon] and 

20 equilibrated in 50 mM Tris, pH 8.0. 

Upon completion of loading of the medium, the 
column was washed with buffer containing 50 mM Tris, 0.4 
M NaCl, pH 8.0 until the absorbance at 280 nm reached 
baseline. The column was then washed with 50 mM Tris, pH 

25 8.0 to remove NaCl from the buffer. The resin was then 
washed with 50 mM Tris, 0.2 M NaCl, 4 M Urea, pH 8.0 
until a peak had eluted. The column was then washed into 



63 

50 bH Tris, pH 8*0 to remove the urea. 

The bound BMP-2/7 or BMP-2/6 was then eluted 
using 50 mM Tris, 0.5 M NaCl, 0.5 M Arginine, pH 8.0. 
The eluate was collected as a single pool and may be 
optionally stored frozen prior to further purification. 
This Cellufine Sulfate eluate was diluted with 14 volumes 
of 6M urea and the pH of the sample was then adjusted to 
6.0. A hydroxyapatite-Ultrogel [IBF] column was poured 
and equilibrated with 80 mM potassium phosphate, 6M urea, 
pH 6.0. 

After the completion of sample loading, the 
column was washed with 10 bed volumes of the 
equilibration buffer. Bound BMP-2/7 or BMP-2/6 
heterodimers were eluted with 5 bed volumes of 100 mM 
potassium phosphate, 6M urea, pH 7.4. This eluate was 
loaded directly onto a Vydac C 4 reverse-phase HPLC column 
equilibrated in water - 0.1% TFA. BMP-2/7 or BMP-2/6 
heterodimers were eluted with a gradient of 30-50% 
acetonitrile in water - 0.1% trif luoroacetic acid. 

Fractions containing BMPs are identified by SDS-PAGE 
in the presence or absence of reductant. The identity of 
the BMPs with respect to the heterodimers vs. homodimers 
is determined by 2D-PAGE (+/- reductant) . Fractions with 
heterodimers gave bands which reduce to two spots. Bands 
from homodimer fractions reduce to a single spot for each 
BMP species. 
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The BMP-2/6 heterodimer subunits are analyzed 
on a protein seguenator. BMP-2/6 heterodimers of the 
followig species are present: BMP-6 subunit beginning 
with amino acid #375 Ser-Ala-Ser-Ser in association with 
BMP-2 subunit beginning with amino acid #283 Gin-Ala-Lys 
or #249 Ser-Lev-His, though other less abundant species 
may be present. 

It is contemplated that the same or substantially similar 
purification techniques may be employed for any 
recombinant BMP heterodimer of this invention. The 
hydroxyapatite-Ultrogel column may be unnecessary and 
that the purification scheme may be modified by loading 
the Cellufine Sulfate eluate directly onto the C 4 reverse- 
phase HPLC column without use of the former column for 
BMP2/7 or BMP-2/6 or the other heterodimers of this 
invention. 



EXAMPLE 5 - PROTEIN CHARAC TERIZATION 

Total protein secreted from the co-expressing 
cell lines is analyzed after labelling with 35 s-methionine 
or by Western blot analysis using antibodies raised 
against both BMPs of the heterodimer, e.g., BMP-2 and 
BMP-7. Together with the alkaline phosphatase assays, 
the data indicates the presence of the heterodimer and 
the specific activity. The following specific details 
are directed towards data collected for the BMP-2 /7 and 
BMP-2/6 heterodimers; however, by application of similar 
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methods to the other heterodimers described herein , 
similar results are expected. 

A. 35 S-Met labelling 

Cell lines derived by cotransfection of 
BMP2A-EMC and BMP7a-EMC expression vectors were pulsed 
with 35 S-methionine for 15 minutes, and chased for 6 hours 
in serum free media in the presence or absence of 
heparin. Total secreted protein was analyzed under 
reducing conditions by PAGE and f luorography . The 
results demonstrate that several cell lines secrete both 
BMP-2 and BMP-7 protein. There is a good correlation 
between the amount of alkaline phosphatase activity and 
the amount of coexpressed protein. 

Several cell lines secrete less total BMP- 
2 and 7 than the BMP-2-only expressing cell line 2EG5, 
which produces 10 Mg/ml BMP-2. Cell line 2E7E-10 
(amplified at a level of 0.5mM MTX) secretes equal 
proportions of BMP-2 and BMP-7 at about the same overall 
level of expression as the cell line 2EG5. Cell line 
2E7E-10 produces the equivalent of 600 micrograms/ml of 
BMP-2 homodimer activity in one assay. 

Total labelled protein was also analyzed on a 
two-dimensional non-reducing/reducing gel system to 
ascertain whether a heterodimer is made. Preliminary 
results demonstrate the presence of a unique spot in this 
gel system that is not found in either the BMP-2-only or 
BMP-7-only cell lines , suggesting the presence of 2/7 
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heterodimer. The same gel with purified material 
produced the same results (e.g. , two unique spots on the 
gel) indicative of the presence of the 2/7 heterodimer. 
The homodimer of BMP2 produced distinct species on this 

5 gel system. 

In contrast to the recombinant BMP-2/7 purification, 
BMP-2 homodimers are not detected during the BMP-2/6 
preparation; however, significant amounts of BMP-6 
homodimers are found. In addition, a significant amount 

0 of a -20 amino acid N-terminal truncated form of BMP-6 is 
found; this could be eliminated by the inclusion of 
protease inhibitors during cell culture. BMP-2/6 was 
found to elute two to three fractions later from C4 RP- 
HPLC than did BMP-2/7. 

5 Amino acid sequencing indicates that the predominant 

BMP-2/7 heterodimer species comprises a mature BMP-2 
subunit [amino acid #283 (Gln)-#396(Arg) ] and a mature 
subunit of BMP-7 [#293 (Ser) -#431 (His)]. BMP-2/6 
heterodimer comprises the mature BMP-2 subunit (#283-396) 

0 and the mature BMP-6 subunit [#375 (Ser) -#513 (His) ] . 



B. TTMwunoprecipitation coupled to Western blot analysis 

Conditioned media from a BMP-2 -only 
(2EG5) , a BMP-7-only (7MB9) , or the 2E7E-10 co-expressing 
cell line were subjected to immunoprecipitation with 
either a BMP-2 or BMP-7 antibody (both conventional 
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polyclonal antibodies raised in rabbits) , then analyzed 
on Western blots probed with either an anti-BMP-2 or 
anti-BMP-7 antibody. The 2/7 heterodimer precipitates 
and is reactive on Western blots with both the BMP-2 and 
BMP-7 antibodies, while either BMP by itself reacts with 
its specific antibody, but not with the reciprocal 
antibody. 

It has been demonstrated using this 
strategy that a protein in the co-expressing cell line 
that is precipitated by the anti-BMP-7 antibody W33 
[Genetics Institute, Inc, Cambridge, Massachusetts] and 
reacts on a Western blot with the anti-BMP-2 antibody W12 
or W10 [Genetics Institute, Inc.] is not present in the 
BMP-2 or 7-only expressing cell lines. This experiment 
indicates that this protein species is the heterodimeric 
protein. Conversely, precipitation with W12 and probing 
with W33 yielded similar results. 

EXAMPLE 6 - SPECIFIC ACTIVITY OF HETERODIMERS 
A. In vitro Assays 

The specific activity of the BMP-2/7 or BMP-2/6 
heterodimer and the BMP-2 homodimer secreted into growth 
medium of the stable cell lines 2E7E-10 and 2EG55, and 
12C07 and 2EG5, respectively, were estimated as follows. 

The amount of BMP protein in conditioned medium 
was measured by either Western blot analysis or by 
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analyzing protein secreted from 35 S-methionine labelled 
cells by PAGE and fluorography. The amount of activity 
produced by the same cell lines on W20 cells using either 
the alkaline phosphatase assay or osteocalcin-induction 
5 assay was then estimated. The specific activity of the 
BMP was calculated from the ratio of activity to protein 
secreted into the growth medium. 

In one experiment 2E7E-10 and 2EG5 secreted 
similar amounts of total BMP proteins as determined by 

10 PAGE and fluorography. 2E7E-10 produced about 50-fold 
more alkaline phosphatase inducing activity the 2EG5, 
suggesting that the specific activity of the heterodimer 
is about 50-fold higher than the homodimer. 

In another experiment the amount of BMP-2 

15 secreted by 2EG5 was about 50% higher than BMP-2/ 7 

secreted by 2E7E-10, however, 2E7E-10 produced about 10- 
fold more osteocalcin-inducing activity that 2EG5. From 
several different experiments of this type the specific 
activity of the BMP-2/7 heterodimer is estimated to be 

20 between 5 to 50 fold higher than the BMP-2 homodimer. 

Figures 8 and 9 compare the activity of BMP-2 
and BMP-2/7 in the W20 alkaline phosphatase and BGP (Bone 
Gla Protein, osteocalcin) assays. BMP-2/7 has greatly 
increased specific activity relative to BMP-2 (Figure 8) . 

25 From Figure 8, approximately 1.3 ng/ml of BMP-2/7 was 
sufficient to induce 50% of the maximal alkaline 
phosphatase response in W-20 cells. A comparable value 
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for BMP-2 is difficult to calculate, since the alkaline 
phosphatase response did not maximize, but greater than 
30 ng/ml is needed for a half -maximal response. BMP-2 /7 
thus has a 20 to 30-fold higher specific activity than 
BMP-2 in the W-20 assay. 

As seen in Figure 9, BMP-2/ 7 was also a more 
effective stimulator of BGP (bone gla protein, 
osteocalcin) production than BMP-2 in this experiment. 
Treating W-20-17 cells with BMP-2 /7 for four days 
resulted in a maximal BGP response with 62 ng/ml, and 11 
ng/ml elicits 50% of the maximal BGP response. In 
contrast, maximal stimulation of BGP synthesis by BMP-2 
was not seen with doses up to 4 68 ng/ml of protein. The 
minimal dose of BMP-2 /7 needed to elicit a BGP response 
by W-20-17 cells was 3.9 ng/ml, about seven-fold less 
than the 29 ng/ml required of BMP-2. These results were 
consistent with the data obtained in the W-20-17 alkaline 
phosphatase assays for BMP-2 and BMP-2/7. 

Preliminary analysis indicates that BMP-2/ 6 has 
a specific activity in vitro similar to that of BMP-2/7. 
The potencies of BMP-2 and BMP-2/ 6 on induction of 
alkaline phosphatase production in W-20 is compared, as 
shown in Figure 12, BMP-2/ 6 has a higher specific 
activity than BMP-2 in this assay system. This data is 
in good agreement with data obtained from the in vivo 
assay of BMP-2 and BMP-2/6) . 
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B. in Vivo Assay 
(i) BMP-2/7 

The purified BMP-2/7 and BMP-2 were tested in 
the rat ectopic bone formation assay. A series of 
5 different amounts of BMP-2/7 or BMP-2 were implanted in 
triplicate in rats. After 5 and 10 days, the implants 
were removed and examined histologically for the presence 
of bone and cartilage. The histological scores for the 
amounts of new cartilage and bone formed are summarized 
10 in Table A. 
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Table A 



S Day Implants 



10 Day Implants 



BMP-2/7 



BMP-2 



BMP-2/7 



BMP-2 



0.04 

0.02 

1.0 

5.0 

25.0 



C 
B 
C 
B 
C 
B 
C 
B 
C 
B 



± - ± 
± 1 ± 

1 ± ± 

2 2 1 

± - 1 



± ± ± 
1 ± 1 



± - ± 
± - ± 
2 1 2 

1 ± 1 

2 2 2 
2 3 3 
1 1 2 
4 4 3 
± ±2 
4 4 3 



- ± ± 

- ± - 
1 1 ± 

1 1 ± 

1 2 1 

2 3 2 

2 2 2 

3 3 3 



The amount of BMP-2/7 required to induce cartilage and 
bone in the rat ectopic assay is lower than that of BMP- 
2. Histologically, the appearance of cartilage and bone 
induced by BMP-2/7 and BMP-2 are identical. 
5 (ii) BMP-2/6 

The in vivo activity of BMP-2/6 was compared with 
that of BMP-2 by implantation of various amounts of each 
BMP for ten days in the rat ectopic bone formation assay. 
The results of this study (Table B, Figure 13) indicate 
10 that BMP-2/6, similar to BMP-2/7, has increased in vivo 
activity relative to BMP-2. The specific activities of 

?! 

BMP-2, BMP-6, and BMP-2/6 are compared in the ectopic 
« bone formation assay ten days after the proteins are 

implanted. The results of these experiments are shown in 
15 Table C and Figure 14. BMP-2/6 is a more potent inducer 

of bone formation than either BMP-2 or BMP-6. The amount 
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of bone formation observed with BMP-2/6 was comparable 
that observed with equivalent doses of BMP-2/7. The 
appearance of BMP-2/6 implants is quite similar to 
implants containing BMP-2 or BMP-2/7. 

Table B 

Histological scores of Implants of BMP 2/6 and BMP-2 In rat ectopic 
assay (10 day implants). . 



10 



BMP U/g) 


C/B 


BMP-2/6 


BMP-2 


0.04 


c 

B. 


- ± - 




0.20 


C 
B 


l l ± 
± ± ± 




1.0 


C 
B 


13 3 
12 2 


ii± 
i i ± 


5.0 


C 
B 


2 2 2 
2 3 3 


12 2 
2 2 2 


25. 


C 
B 


111 

3 3 3. 


2 2 1 

3 3 3 



Table C 

Histological scores of implants of BMP-2, BMP-6, and BMP-2/6 in rat 
ectopic assay (10 day implants). 
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BMP U/g) 


C/B 


BMP-2 


BMP-6 


BMP-2/6 


0.04 


c 

B 






— ± 

— ± 


0.20 


C 
B 


- - 2 
1 




12 2 
2 2 2 


1.0 


C 
B 


- ± ± 

- ± ± 


2 11 

1 ± ± 


111 
3 3 2 


5.0 


C 
B 


2 2 1 
111 


3 13 
2 ± 1 


± ± 1 
4 5 4 


25. 


C 
B 


± ± ± 
5 4 5 


± ± ± 
4 4 5 


± ± ± 
4 5 3 



EXAMPLE 7 - EXPRESSTON OF B MP DTMER IN E. COLI 

A biologically active, homodimeric BMP-2 was 
expressed in coli using the techniques described in 
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European Patent Application 433,255 with minor 
modifications. Other methods disclosed in the above- 
referenced European patent application may also be 
employed to produce heterodimers of the present invention 
from E. coli . Application of these methods to the 
heterodimers of this invention is anticipated to produce 
active BMP heterodimeric proteins from E. coli, 

A. BMP-2 Expression Vector 

An expression plasmid pALBP2-781 (Figure 
7) (SEQ ID NO: 13) was constructed containing the mature 
portion of the BMP-2 (SEQ ID NO: 14) gene and other 
sequences which are described in detail below. This 
plasmid directed the accumulation of 5-10% of the total 
cell protein as BMP-2 in an E^. coli host strain, GI724, 
described below. 

Plasmid pALBP2-781 contains the following 
principal features. Nucleotides 1-2060 contain DNA 
sequences originating from the plasmid pUC-18 [Norrander 
et al, Gene , 26:101-106 (1983)] including sequences 
containing the gene for j8-lactamase which confers 
resistance to the antibiotic ampicillin in host Ej_ coli 
strains, and a colEl-derived origin of replication. 
Nucleotides 2061-2221 contain DNA sequences for the major 
leftward promoter (pL) of bacteriophage X [Sanger et al, 
J. Mol. Biol. . 162:729-773 (1982)], including three 
operator sequences, 0 L 1, 0 L 2 and 0 L 3 . The operators are 
the binding sites for Xcl repressor protein, 
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intracellular levels of which control the amount of 
transcription initiation from pL. Nucleotides 2222-2723 
contain a strong ribosome binding sequence included on a 
sequence derived from nucleotides 35566 to 35472 and 
38137 to 38361 from bacteriophage lambda as described in 
Sanger et al, .t. moI . Biol. , 162:729-773 (1982). 
Nucleotides 2724-3133 contain a DNA sequence encoding 
mature BMP-2 protein with an additional 62 nucleotides of 
3 ' -untranslated sequence . 

Nucleotides 3134-3149 provide a "Linker" DNA 
sequence containing restriction endonuclease sites. 
Nucleotides 3150-3218 provide a transcription termination 
sequence based on that of the JL^ coli &§pA gene [Takagi 
et al, Kntvt. Acids Res. . 11:2063-2074 (1985)]. 
Nucleotides 3219-3623 are DNA sequences derived from pUC- 
18. 

As described below, when cultured under 
the appropriate conditions in a suitable E^. coli host 
strain, pALBP2-781 can direct the production of high 
levels (approximately 10% of the total cellular protein) 

of BMP-2 protein. 

pALBP2-781 was transformed into the JL. coli 
host strain GI724 (F, lacl", lacP", ampC::XcI + ) by the 
procedure of Dagert and Ehrlich, Gene. 6:23 (1979). [The 
untransformed host strain coli GI724 was deposited 
with the American Type Culture Collection, 12301 Parklawn 
Drive, Rockville, Maryland on January 31, 1991 under ATCC 
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No. 55151 for patent purposes pursuant to applicable laws 
and regulations.] Transf ormants were selected on 1.5% 
w/v agar plates containing IMC medium, which is composed 
of M9 medium [Miller, "Experiments in Molecular 
Genetics", Cold Spring Harbor Laboratory, New York 
(1972)] supplemented with 0.5% w/v glucose, 0.2% w/v 
casamino acids and 100 fig/ral ampicillin. 

GI724 contains a copy of the wild-type Xcl 
repressor gene stably integrated into the chromosome at 
the ampC locus, where it has been placed under the 
transcriptional control of Salmonella typhimurium trp 
promoter/ operator sequences. In GI724, Xcl protein is 
made only during growth in tryptophan-free media, such as 
minimal media or a minimal medium supplemented with 
casamino acids such as IMC, described above. Addition of 
tryptophan to a culture 'of GI724 will repress the trp 
promoter and turn off synthesis of Xcl, gradually causing 
the induction of transcription from pL promoters if they 
are present in the cell. 

GI724 transformed with pALBP2-781 was 
grown at 37°C to an of 0.5 (Absorbence at 550 nm) in 
IMC medium. Tryptophan was added to a final 
concentration of 100 fMg/nl and the culture incubated for 
a further 4 hours. During this time BMP-2 protein 
accumulated to approximately 10% of the total cell 
protein, all in the "inclusion body" fraction. 

BMP-2 is recovered in a non-soluble, 
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monomeric form as follows. Cell disruption and recovery 
is performed at 4«C. Approximately 9 g of the wet 
fermented e. coli GI724/pALBP2-781 cells are suspended in 
30 mL of 0.1 M Tris/HCl f 10 mM EDTA, 1 mM phenyl methyl 
sulphonyl fluoride (PMSF) , pH 8.3 (disruption buffer). 
The cells are passed four times through a cell disrupter 
and the volume is brought to 100 mL with the disruption 
buffer. The suspension is centrifuged for 20 min. 
(15,000 x g). The pellet obtained is suspended in 50 mL 
disruption buffer containing 1 M NaCl and centrifuged for 
10 min. as above. The pellet is suspended in 50 mL 
disruption buffer containing 1% Triton X-100 (Pierce) and 
again centrifuged for 10 min. as above. The washed 
pellet is then suspended in 25 mL of 20 mM Tris/HCl, 1 mM 
15 EDTA, 1 mM PMSF, 1% DTT, pH 8.3 and homogenized in a 
glass homogenizer. The resulting suspension contains 
crude monomeric BMP-2 in a non-soluble form. 

Ten mL of the BMP-2 suspension, obtained 
as described above, are acidified with 10% acetic acid to 
20 pH 2.5 and centrifuged in an Eppendorf centrifuge for 10 
min. at room temperature. The supernatant is 
chromatographed. Chromatography was performed on a 
Sephacryl S-100 HR column (Pharmacia, 2.6 x 83 cm) in 1% 
acetic acid at a flow rate of 1.4 mL/minute. Fractions 
25 containing monomeric, BMP-2 are pooled. This material is 
used to generate biologically active, homodimer BMP-2. 

Biologically active, homodimeric BMP-2 can 
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be generated from the monomer ic BMP-2 obtained following 
solubilization and purification, described above, as 
follows. 

0.1, 0.5 or 2.5 mg of the BMP-2 is 
dissolved at a concentration of 20, 100 or 500 /xg/mL, 
respectively, in 50 mM Tris/HCl, pH 8.0, 1 M NaCl, 5 mM 
EDTA, 2 mM reduced glutathione, 1 mM oxidized glutathione 
and 33 mM CHAPS [Calbiochem] . After 4 days at 4°C or 
23 °C, the mixture is diluted 5 to 10 fold with 0.1% TFA. 

Purification of biologically active BMP-2 
is achieved by subjecting the diluted mixture to reverse 
phase HPLC on a a Vydac C4 214TP54 column (25 x .46 cm) 
[The NEST Group, USA] at a flow rate of 1 ml /minute. 
Buffer A is 0.1% TFA. Buffer B is 90% acetonitrile, and 
0.1% TFA. The linear gradient was 0 to 5 minutes at 20% 
Buffer B; 5 to 10 minutes at 20 to 30 % Buffer B; 10 to 
40 minutes at 30 to 60% Buffer B; and 40 to 50 minutes at 
60 to 100% Buffer B. Homodimeric BMP-2 is eluted and 
collected from the HPLC column. 

The HPLC fractions are lyophilized to 
dryness, redissolved in sample buffer (1.5 M Tris-HCl, pH 
8.45, 12% glycerol, 4% SDS, .0075% Serva Blue G, .0025% 
Phenol Red, with or without 100 mM dithiothreitol) and 
heated for five minutes at 95 °C. The running buffer is 
100 mM Tris, 100 mM tricine (16% tricine gel) [Novex], 
0.1% SDS at pH 8.3. The SDS-PAGE gel is run at 125 volts 
for 2.5 hours. 
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The gel is stained for one hour with 200 
ml of 0.5% Coomassie Brilliant Blue R-250, 25% 
isopropanol, 10% acetic acid, heated to 60 »C. The gel is 
then destained with 10% acetic acid, 10% isopropanol 

5 until the background is clear. 

The reduced material ran at approximately 
13kD; the non-reduced material ran at approximately 30 
kD, which is indicative of the BMP-2 dimer. This 
material was later- active in the W20 assay of Example 8. 

0 B. BMP-7 Expressio n Vector 

For high level expression of BMP-7 a 
plasmid pALBMP7-981 was constructed. pAlBMP7-981 is 
identical to plasmid pALBP2-781 with two exceptions: the 
BMP-2 gene (residues 2724-3133 of pALBP2-781) is replaced 

5 by the mature portion of the BMP-7 gene, deleted for 

sequenced encoding the first seven residues of the mature 
BMP-7 protein sequence: 
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ATGTCTCATAATC GTTCTAAAAC TCCAAAAAAT CAAGAAGCTC TGCGTATGGC 



CAACGTGGCA 


GAGAACAGCA 


GCAGCGACCA 


GAGGCAGGCC 


TGTAAGAAGC 


ACGAGCTGTA 


TGTCAGCTTC 


CGAGACCTGG 


GCTGGCAGGA 


CTGGATCATC 


GCGCCTGAAG 


GCTACGCCGC 


CTACTACTGT 


GAGGGGGAGT 


GTGCCTTCCC 


TCTGAACTCC 


TACATGAACG 


CCACCAACCA 


CGCCATCGTG 


CAGACGCTGG 


TCCACTTCAT 


CAACCCGGAA 


ACGGTGCCCA 


AGCCCTGCTG 


TGCGCCCACG 


CAGCTCAATG 


CCATCTCCGT 


CCTCTACTTC 


GATGACAGCT 


CCAACGTCAT 


CCTGAAGAAA 


TACAGAAACA 


TGGTGGTCCG 


GGCCTGTGGC 


TGCCACTAGC 


TCCTCCGAGA 


ATTCAGACCC 


TTTGGGGCCA 


AGTTTTTCTG 


GATCCT 



and the ribosome binding site found between residues 
2707 and 2723 in pALBP2-781 is replaced by a different 
ribosome binding site, based on that found preceding the 
T7 phage gene 10, of sequence 5 9 -CAAGAAGGAGATATACAT-3 ' . 
The host strain and growth conditions used for the 
production of BMP-7 were as described for BMP-2. 
C. BMP-3 Expression Vector 

For high level expression of BMP-3 a 
plasmid pALB3-782 was constructed. This plasmid is 
identical to plasmid pALBP2-781, except that the BMP-2 
gene (residues 2724-3133 of pALBP2-781) is replaced by a 
gene encoding a form of mature BMP-3. The sequence of 
this BMP-3 gene is: 
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ATGCGTAAAC AATGGATTGA ACCACGTAAC TGTGCTCGTC GTTATCTGAA 
AGTAGACTTT GCAGATATTG GCTGGAGTGA ATGGATTATC TCCCCCAAGT 
CCTTTGATGC CTATTATTGC TCTGGAGCAT GCCAGTTCCC CATGCCAAAG 
TCTTTGAAGC CATCAAATCA TGCTACCATC CAGAGTATAG TGAGAGCTGT 
5 GGGGGTCGTT CCTGGGATTC CTGAGCCTTG CTGTGTACCA GAAAAGATGT 

CCTCACTCAG TATTTTATTC TTTGATGAAA ATAAGAATGT AGTGCTTAAA 
GTATACCCTA ACATGACAGT AGAGTCTTGC GCTTGCAGAT AACCTGGCAA 
AGAACTCATT TGAATGCTTA ATTCAAT 

The host strain and growth conditions used for the 
10 production of BMP-3 were as described for BMP-2. 

D. Expression of a BMP— 2 II Heterodime r in E. 

coli 

Denatured and purified E^. coli BMP-2 and BMP-7 
monomers were isolated from E^. coli inclusion body 
15 pellets by acidification and gel filtration as previously 
as previously described above. 125 ug of each BMP in 1% 
acetic acid were mixed and taken to dryness in a speed 
vac. The material was resuspended in 2.5 ml 50 mM Tris, 
1.0 NaCl, 5 mM EDTA, 33 mM CHAPS, 2 mM glutathione 
20 (reduced), 1 mM glutathione (oxidized), pH 8.0. The 
sample was incubated at 23 C for one week. 

The BMP-2/ 7 heterodimer was isolated by 
HPLC on a 25 x 0.46 cm Vydac C4 column. The sample was 
centrifuged in a microfuge for 5 minutes, and the 
25 supernatant was diluted with 22.5 ml 0.1% TFA. 

A buffer : 0.1% TFA 

B buffer : 0.1% TFA, 95% acetonitrile 
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1.0 ml /minute 

0-5' 20% B 

5-10' 20-30% B 

10-90' 30-50% B 

90-100' 50-100% B 
By SDS-PAGE analysis, the BMP-2/7 heterodimer eluted at 
about 23' . 

Figure 10 is a comparison of the W-20 activity of E. 
coli BMP-2 and BMP-2/7 heterodimer, indicating greater 
activity of the heterodimer. 

F. Expression of BMP-2/ 3 Heterodimer in E. 

coli 

BMP-2 and BMP- 3 monomers were isolated as 
follows: to 1.0 g of frozen harvested cells expressing 
either BMP-2 or BMP-3 was added 3.3 ml of 100 mM Tris, 10 
mM EDTA, pH 8.3. The cells were resuspended by vortexing 
vigorously. 33 ul of 100 mM PMSF in isopropanol was 
added and the cells lysed by one pass through a French 
pressure cell. The lysate was centrifuged in a microfuge 
for 20 minutes at 4 C. The supernatant was discarded. 
The inclusion body pellet was taken up in 8.0 M quanidine 
hydrochloride, 0.25 M OTT, 0.5 M Tris, 5 mM EDTA, pH 8.5, 
and heated at 37 C for one hour. 

The reduced and denatured BMP monomers were isolated 
by HPLC on a Supelco C4 guard column as follows: 
A buffer : 0.1% TFA 

B buffer : 0.1% TFA, 95% acetonitrile 
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1.0 ml/minute 
0-5' 1* B 

5-40' 1-70% B 

40-45' 70-100% B 
5 Monomeric BMP eluted at 28-30'. Protein concentration 
was estimated by A280 and the appropriate extinction 
coefficient. 

10 ug of BMP-2 and BMP-3 were combined and taken to 
dryness in a speed vac. To this was added 50 ul of 50 

10 mM Tris, 1.0 M NaCi, 5 mM EDTA, 33 mM CHAPS, 2 mM reduced 
glutathione, 1 mM oxidized glutathione, pH 8.5. The 
sample was incubated at 23 for 3 days. The sample was 
analyzed by SDS-PAGE on a 16% tricine gel under reducing 
and nonreducing conditions. The BMP-2/3 heterodimer 

15 migrated at about 35 kd nonreduced, and reduced to BMP-2 
monomer at about 13 kd and BMP-3 monomer at about 21 kd. 

BMP-2/3 heterodimer produced in E. coli is 
tested for in vivo activity. (20 /zg) at (ten days) is 
utilized to compare the in vivo activity of BMP-2/3 to 

20 BMP-2. BMP-2/3 implants showed no cartilage or bone 

forming activity, while the BMP-2 control implants showed 
the predicted amounts of bone and cartilage formation. 
The in vivo data obtained with BMP-2/3 is consistent with 
the in vitro data from the W-20 assay. 
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EXAMPLE 8 - W-20 BIOASSAYS 

A. Description of W-20 cells 

Use of the W-20 bone marrow stromal cells 
as an indicator cell line is based upon the conversion of 
these cells to osteoblast-like cells after treatment with 
BMP-2 [R. S. Thies et al, "Bone Morphogenetic Protein 
alters W-20 stromal cell differentiation in vitro" , 
Journal of Bone and Mineral Research , 5(2):305 (1990); 
and R. S. Thies et al, "Recombinant Human Bone 
Morphogenetic Protein 2 Induces Osteoblastic 
Differentiation in W-20-17 Stromal Cells" , Endocrinology , 
in press (1992)]/ Specifically, W-20 cells are a clonal 
bone marrow stromal cell line derived from adult mice by 
researchers in the laboratory of Dr. D. Nathan, 
Children's Hospital, Boston, MA. BMP-2 treatment of W-20 
cells results in (1) increased alkaline phosphatase 
production, (2) induction of PTH stimulated cAMP, and (3) 
induction of osteocalcin synthesis by the cells. While 
(1) and (2) represent characteristics associated with the 
osteoblast phenotype, the ability to synthesize 
osteocalcin is a phenotypic property only displayed by 
mature osteoblasts. Furthermore , to date we have 
observed conversion of W-20 stromal cells to osteoblast- 
like cells only upon treatment with BMPs. In this 
manner, the in vitro activities displayed by BMP treated 
W-20 cells correlate with the in vivo bone forming 
activity known for BMPs. 
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Below two in vitro assays useful in comparison 
of BMP activities of novel osteoinductive molecules are 
described. 

B. w-20 Alkaline Phosphatase Assay Protocol 
5 W-20 cells are plated into 96 well tissue 

culture plates at a density of 10,000 cells per well in 
200 ill of media (DME with 10% heat inactivated fetal calf 
serum, 2 mM glutamine and 100 U/ml + 100 ng/nl 
streptomycin. The cells are allowed to attach overnight 
10 in a 95% air, 5% C0 2 incubator at 37 °C. 

The 200 nl of media is removed from each 
well with a multichannel pipettor and replaced with an 
equal volume of test sample delivered in DME with 10% 
heat inactivated fetal calf serum, 2 mM glutamine and 1% 
15 penicillin-streptomycin. Test substances are assayed in 
triplicate. 

The test samples and standards are allowed 
a 24 hour incubation period with the W-20 indicator 
cells. After the 24 hours, plates are removed from the 
20 37°C incubator and the test media are removed from the 
cells . 

The W-20 cell layers are washed 3 times 
with 200 Ml per wel1 of calcium/magnesium free phosphate 
buffered saline and these washes are discarded. 
25 50 nl of glass distilled water is added to 

each well and the assay plates are then placed on a dry 
ice/ethanol bath for quick freezing. Once frozen, the 
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assay plates are removed from the dry ice/ethanol bath 
and thawed at 37 °C. This step is repeated 2 more times 
for a total of 3 freeze-thaw procedures. Once complete, 
the membrane bound alkaline phosphatase is available for 
measurement. 

50 Ml of assay mix (50 mM glycine, 0.05% 
Triton X-100, 4 mM MgCl 2 , 5 mM p-nitrophenol phosphate, pH 
=10.3) is added to each assay well and the assay plates 
are then incubated for 30 minutes at 37 °C in a shaking 
waterbath at 60 oscillations per minute. 

At the end of the 30 minute incubation, 
the reaction is stopped by adding 100 /il of 0.2 H NaOH to 
each well and placing the assay plates on ice. 

The spectrophotometric absorbance for each 
well is read at a wavelength of 405 nanometers. These 
values are then compared to known standards to give an 
estimate of the alkaline phosphatase activity in each 
sample. For example, using known amounts of p- 
nitrophenol phosphate, absorbance values are generated. 
This is shown in Table I. 



Table I 

Absorbance Values for Known Standards 
of P-Nitrophenol Phosphate 

P-nitrophenol phosphate umoles Mean absorbance (405 nm) 



0.000 0 

0.006 0.261 +/- .024 

0.012 0.521 +/- .031 

Q.018 0.797 +/- .063 
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0 024 1 ' 074 + /" • 061 

S!030 1.305 +/- .083 



Absorbance values for known amounts of 
BMP-2 can be determined and converted to /moles of p- 
nitrophenol phosphate cleaved per unit time as shown in 
Table II. 

Table II 

Alkaline Phosphatase Values for W-20 Cells 
Treating with BMP-2 

BMP-2 concentration Absorbance Reading umole ^ su ^ rate 
na/n] *ns nmeters per hour 

0 0.645 0-024 

1 56 0.696 • 0.026 
15 3 12 0.765 0.029 

6*25 0.923 0.036 

12.50 1-121 °- 044 

25 0 1-457 0.058 

, n 50 0 1.662 0.06.7 

20 100?0 1-977 0.080 



10 



These values are then used to compare the 
activities of known amounts of BMP heterodimers to BMP-2 
25 homodimer. 

C. osteocalcin RIA P rotocol 

W-20 cells are plated at 10 6 cells per well 
in 24 well multiwell tissue culture dishes in 2 mis of 
DME containing 10% heat inactivated fetal calf serum, 2 
30 mM glutamine. The cells are allowed to attach overnight 
in an atmosphere of 95% air 5% C0 2 at 37°C. 

The next day the medium is changed to DME 
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containing 10% fetal calf serum, 2 mM glutamine and the 
test substance in a total volume of 2 ml. Each test 
substance is administered to triplicate wells. The test 
substances are incubated with the W-20 cells for a total 
of 96 hours with replacement at 48 hours by the same test 
medias . 

At the end of 96 hours, 50 fil of the test 
media is removed from each well and assayed for 
osteocalcin production using a radioimmunoassay for mouse 
osteocalcin. The details of the assay are described in 
the kit manufactured by Biomedical Technologies Inc., 378 
Page Street, Stoughton, MA 02072. Reagents for the 
assay are found as product numbers BT-431 (mouse 
osteocalcin standard) , BT-432 (Goat anti-mouse 
Osteocalcin) , BT-431R (iodinated mouse osteocalcin) , BT- 
415 (normal goat serum) and BT-414 (donkey anti goat 
IgG) . The RIA for osteocalcin synthesized by W-20 cells 
in response to BMP treatment is carried out as described 
in the protocol provided by the manufacturer. 

The values obtained for the test samples 
are compared to values for known standards of mouse 
osteocalcin and to the amount of osteocalcin produced by 
W-20 cells in response to challenge with known amounts of 
BMP-2. The values for BMP-2 induced osteocalcin 
synthesis by W-20 cells is shown in Table III. 
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Table III 



5 



10 



15 



Osteocalcin Synthesis by W-20 Cells 
concentration ncr/ml Osteocalcin Synthesis nq/well 


0 


0.8 


2 


0.9 


4 


0.8 


8 


2.2 


16 


2.7 


31 


3.2 


62 


5.1 


125 


6.5 


250 


8.2 


500 


9.4 


1000 


10.0 



TCYAMPLE 9 - ROSEN MODIFIED S AMPATH— REDDI ASSAY 

A modified version of the rat bone formation 
assay described in Sampath and Reddi, Proc. Natl f Acad. 
sci. USA . 80:6591-6595 (1983) is used to evaluate bone 
and/or cartilage activity of BMP proteins. This modified 
assay is herein called the Rosen-modified Sampath-Reddi 
assay. The ethanol precipitation step of the Sampath- 
Reddi procedure is replaced by dialyzing (if the 
composition is a solution) or diafiltering (if the 
composition is a suspension) the fraction to be assayed 
against water. The solution or suspension is then 
redissolved in 0.1% TFA, and the resulting solution added 
to 20 mg of rat matrix. A mock rat matrix sample not 
treated with the protein serves as a control. This 
material is frozen and lyophilized and the resulting 
powder enclosed in #5 gelatin capsules. The capsules are 
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implanted subcutaneously in the abdominal thoracic area 
of 21-49 ay old male Long Evans rats. The implants are 
removed after 7-14 days. Half of each implant is used 
for alkaline phosphatase analysis [see, A. H. Reddi et 
al, Proc. Natl. Acad. Sci. . 69:1601 (1972)]. 

The other half of each implant is fixed and 
processed for histological analysis. 1 /xm 
glycolmethacrylate sections are stained with Von Kossa 
and acid fuschin to score the amount of induced bone and 
cartilage formation present in each implant. The terms 
+1 through +5 represent the area of each histological 
section of an implant occupied by new bone and/ or 
cartilage cells and matrix. A score of +5 indicates that 
greater than 50% of the implant is new bone and/or 
cartilage produced as a direct result of protein in the 
implant. A score of +4, +3, +2, and +1 would indicate 
that greater than 40%, 30%, 20% and 10% respectively of 
the implant contains new cartilage and/or bone. 

The heterodimeric BMP proteins of this 
invention may be assessed for activity on this assay. 

Numerous modifications and variations in 
practice of this invention are expected to occur to those 
skilled in the art. Such modifications and variations 
are encompassed within the following claims. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION : 

(i) APPLICANT: Israel , David 

Wolfman, Neil M. 

(ii) TITLE OF INVENTION: Recombinant Bone Morphogenetic Protein 
Heterodimers , Compositions and Methods of Use. 

(iii) NUMBER OF SEQUENCES: 30 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Legal Affairs, Genetics Institute, Inc. 

(B) STREET: 87 CambridgePark Drive 

(C) CITY: Cambridge 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02140-2387 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Tape 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Kapinos, Ellen J. 

(B) REGISTRATION NUMBER: 32 r 245 

(C) REFERENCE/DOCKET NUMBER: GI-5192B 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617-876-1170 

(B) TELEFAX: 617-876-5851 



I) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1607 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 356.. 1543 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
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GTCGACTCTA 


GAGTGTGTGT 


CAGCACTTGG 


CTGGGGACTT 


CTTGAACTTG 


CAGGGAGAAT 


o u 


AACTTGCGCA 


CCCCACTTTG 


CGCCGGTGCC 


TTTGCCCCAG 


CGGAGCCTGC 


TTCGCCATCT 


IPO 

_L £. \J 


CCGAGCCCCA 


CCGCCCCTCC 


ACTCCTCGGC 


CTTGCCCGAC 


ACTGAGACGC 


TGTTCCCAGC 


1 RO 

X O W 


GTGAAAAGAG 


AGACTGCGCG 


GCCGGCACCC 


GGGAGAAGGA 


GGAGGCAAAG 


AAAAGGAACG 


2 4 0 




CCTTGCGCCA 


GGTCCTTTGA 


CCAGAGTTTT 


TCCATGTGGA 


CGCTCTTTCA 


300 


ATGGACGTGT 


CCCCGCGTGC 


TTCTTAGACG 


GACTGCGGTC 


TCCTAAAGGT 


CGACC ATG 


358 



Met 

1 

GTG GCC GGG ACC CGC TGT CTT CTA GCG TTG CTG CTT CCC CAG GTC CTC 4 06 

Val Ala Gly Thr Arg Cys Leu Leu Ala Leu Leu Leu Pro Gin Val Leu 
5 10 15 

CTG GGC GGC GCG GCT GGC CTC GTT CCG GAG CTG GGC CGC AGG AAG TTC 4 54 

Leu Gly Gly Ala Ala Gly Leu Val Pro Glu Leu Gly Arg Arg Lys Phe 
20 25 30 

■ GCG GCG GCG TCG TCG GGC CGC CCC TCA TCC CAG CCC TCT GAC GAG GTC 5 02 

Ala Ala Ala Ser Ser Gly Arg Pro Ser Ser Gin Pro Ser Asp Glu Val - 
35 40 45 

CTG AGC GAG TTC GAG TTG CGG CTG CTC AGC ATG TTC GGC CTG AAA CAG 550 
Leu Ser Glu Phe Glu Leu Arg Leu Leu Ser Met Phe Gly Leu Lys Gin 
50 55 60 65 

AGA CCC ACC CCC AGC AGG GAC GCC GTG GTG CCC CCC TAC ATG CTA GAC 5 98 

Arg Pro Thr Pro Ser Arg Asp Ala Val Val Pro Pro Tyr Met Leu Asp 

70 75 B0 

CTG TAT CGC AGG CAC TCA GGT CAG CCG GGC TCA CCC GCC CCA GAC CAC 64 6 

Leu Tyr Arg Arg His Ser Gly Gin Pro Gly Ser Pro Ala Pro Asp His 
85 90 95 

CGG TTG GAG AGG GCA GCC AGC CGA GCC AAC ACT GTG CGC AGC TTC CAC 69 4 

Arg Leu Glu Arg Ala Ala Ser Arg Ala Asn Thr Val Arg Ser Phe His 
100 105 110 

CAT GAA GAA TCT TTG GAA GAA CTA CCA GAA ACG AGT GGG AAA ACA ACC 74 2 

His Glu Glu Ser Leu Glu Glu Leu Pro Glu Thr Ser Gly Lys Thr Thr 

115 120 125 

CGG AGA TTC TTC TTT AAT TTA AGT TCT ATC CCC ACG GAG GAG TTT ATC 790 
Arg Arg Phe Phe Phe Asn Leu Ser Ser lie Pro Thr Glu Glu Phe lie 
130 135 140 145 

ACC TCA GCA GAG CTT CAG GTT TTC CGA GAA CAG ATG CAA GAT GCT TTA 8 38 

Thr Ser Ala Glu Leu Gin Val Phe Arg Glu Gin Met Gin Asp Ala Leu 
150 155 160 

GGA AAC AAT AGC AGT TTC CAT CAC CGA ATT AAT ATT TAT GAA ATC ATA 88 6 

Gly Asn Asn Ser Ser Phe His His Arg lie Asn lie Tyr Glu lie lie 
165 170 175 

AAA CCT GCA ACA GCC AAC TCG AAA TTC CCC GTG ACC AGA CTT TTG GAC 93 4 

Lys Pro Ala Thr Ala Asn Ser Lys Phe Pro Val Thr Arg Leu Leu Asp 
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180 



185 



190 



ACC AGG TTG GTG AAT CAG AAT GCA AGO AGG TGG GAA ACT TTT GAT GTC 

Thr Arg Leu Val Asn Gin Asn Ala Ser Arg Trp Glu Thr Phe Asp val 

195 200 205 

ACC CCC GCT GTG ATG CGG TGG ACT GCA CAG GGA CAC GCC AAC CAT GGA 
Thr Pro Ala Val Met Arg Trp Thr Ala Gin Gly His Ala Asn His Gly 
210 215 220 Z2i> 

TTC GTG GTG GAA GTG GCC CAC TTG GAG GAG AAA CAA GGT GTC TCC AAG 
Phe Val Val Glu Val Ala His Leu Glu Glu Lys Gin Gly Val Ser Lys 
230 235 240 

AGA CAT GTT AGG ATA AGC AGG TCT TTG CAC CAA GAT GAA CAC AGC TGG 
Arg His val Arg lie Ser Arg Ser Leu His Gin Asp Glu His Ser Trp 
245 250 255 

TCA CAG ATA AGG CCA TTG CTA GTA ACT TTT GGC CAT GAT GGA AAA GGG 
Ser Gin He Arg Pro Leu Leu Val Thr Phe Gly His Asp Gly Lys Gly 
260 265 270 

CAT CCT CTC CAC AAA AGA GAA AAA CGT CAA GCC AAA CAC AAA CAG CGG 
His Pro Leu His Lys Arg Glu Lys Arg Gin Ala Lys His Lys Gin Arg 
275 280 285 

AAA CGC CTT AAG TCC AGC TGT AAG AGA CAC CCT TTG TAC GTG GAC TTC 
Lys Arg Leu Lys Ser Ser Cys Lys Arg His Pro Leu Tyr Val Asp Phe 

AGT GAC GTG GGG TGG AAT GAC TGG ATT GTG GCT CCC CCG GGG TAT CAC 
ser Asp Val Gly Trp Asn Asp Trp He Val Ala Pro Pro Gly Tyr His 

310 315 320 

GCC TTT TAC TGC CAC GGA GAA TGC CCT TTT CCT CTG GCT GAT CAT CTG 
Ala Phe Tyr Cys His Gly Glu Cys Pro Phe Pro Leu Ala Asp His Leu 
325 330 335 

AAC TCC ACT AAT CAT GCC ATT GTT CAG ACG TTG GTC AAC TCT GTT AAC 
Asn ser Thr Asn His Ala He Val Gin Thr Leu Val Asn Ser Val Asn 
340 345 350 

TCT AAG ATT CCT AAG GCA TGC TGT GTC CCG ACA GAA CTC AGT GCT ATC 
Ser Lys He Pro Lys Ala Cys Cys Val Pro Thr Glu Leu Ser Ala He 
355 360 365 

TCG ATG CTG TAC CTT GAC GAG AAT GAA AAG GTT GTA TTA AAG AAC TAT 
Ser Met Leu Tyr Leu Asp Glu Asn Glu Lys Val Val Leu Lys Asn Tyr 
370 375 380 385 

CAG GAC ATG GTT GTG GAG GGT TGT GGG TGT CGC TAGTACAGCA AAATTAAATA 
Gin Asp Met Val Val Glu Gly Cys Gly Cys Arg 
390 395 



982 



1030 



1078 



1126 



1174 



1222 



1270 



1318 



1366 



1414 



1462 



1510 



1563 



CATAAATATA TATATATATA TATATTTTAG AAAAAAGAAA AAAA 



1607 x 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 396 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Val Ala Gly Thr Arg Cys Leu Leu Ala Leu Leu Leu Pro Gin Val 
15 10 15 

Leu Leu Gly Gly Ala Ala Gly Leu Val Pro Glu Leu Gly Arg Arg Lys 
20 25 30 

Phe Ala Ala Ala Ser Ser Gly Arg Pro Ser Ser Gin Pro Ser Asp Glu 
35 40 45 

Val Leu Ser Glu Phe Glu Leu Arg Leu Leu Ser Met Phe Gly Leu Lys 
50 55 60 

Gin Arg Pro Thr Pro Ser Arg Asp Ala Val Val Pro Pro Tyr Met Leu 
65 70 . 75 80 

Asp Leu Tyr Arg Arg Hi? Ser Gly Gin Pro Gly Ser Pro Ala Pro Asp 

85 90 95 

His Arg Leu Glu Arg Ala Ala Ser Arg Ala Asn Thr Val Arg Ser Phe 
100 105 no 

His His Glu Glu Ser Leu Glu Glu Leu Pro Glu Thr Ser Gly Lys Thr 
115 120 125 

Thr Arg Arg Phe Phe Phe Asn Leu Ser Ser He Pro Thr Glu Glu Phe 
130 135 140 

He Thr Ser Ala Glu Leu Gin Val Phe Arg Glu Gin Met Gin Asp Ala 
145 150 155 160 

Leu Gly Asn Asn Ser Ser Phe His His Arg He Asn He Tyr Glu He 
165 ' 170 175 

He Lys Pro Ala Thr Ala Asn Ser Lys Phe Pro Val Thr Arg Leu Leu 
180 185 190 

Asp Thr Arg Leu Val Asn Gin Asn Ala Ser Arg Trp Glu Thr Phe Asp 
195 200 205 

Val Thr Pro Ala Val Met Arg Trp Thr Ala Gin Gly His Ala Asn His 
210 215 220 

Gly Phe Val Val Glu Val Ala His Leu Glu Glu Lys Gin Gly Val Ser 
225 230 235 240 

Lys Arg His Val Arg He Ser Arg Ser Leu His Gin Asp Glu His Ser 
245 250 255 

Trp Ser Gin He Arg Pro Leu Leu Val Thr Phe Gly His Asp Gly Lys 
260 265 270 

Gly His Pro Leu His Lys Arg Glu Lys Arg Gin Ala Lys His Lys Gin 
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275 280 285 

Arg Lys Arg Leu Lys Ser Ser Cys Lys Arg His Pro Leu Tyr Val Asp 
290 295 300 

Phe Ser Asp Val Gly Trp Asn Asp Trp He Val Ala Pro Pro Gly Tyr 
305 310 315 320 

His Ala Phe Tyr Cys His Gly Glu Cys Pro Phe Pro Leu Ala Asp His 
325 330 335 

Leu Asn Ser Thr Asn His Ala He Val Gin Thr Leu Val Asn Ser Val 
340 345 350 

Asn Ser Lys He Pro Lys Ala Cys Cys Val Pro Thr Glu Leu Ser Ala 
355 360 365 

He Ser Met Leu Tyr Leu Asp Glu Asn Glu Lys Val Val Leu Lys Asn 
370 375 380 

Tyr Gin Asp Met Val Val Glu Gly Cys Gly Cys Arg 
385 390 . 395 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1954 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 403.. 1626 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CTCTAGAGGG CAGAGGAGGA GGGAGGGAGG GAAGGAGCGC GGAGCCCGGC CCGGAAGCTA 

GGTGAGTGTG GCATCCGAGC TGAGGGACGC GAGCCTGAGA CGCCGCTGCT GCTCCGGCTG 

AGTATCTAGC TTGTCTCCCC GATGGGATTC CCGTCCAAGC TATCTCGAGC CTGCAGCGCC 

ACAGTCCCCG GCCCTCGCCC AGGTTCACTG CAACCGTTCA GAGGTCCCCA GGAGCTGCTG 

CTGGCGAGCC CGCTACTGCA GGGACCTATG GAGCCATTCC GTAGTGCCAT CCCGAGCAAC 

GCACTGCTGC AGCTTCCCTG AGCCTTTCCA GCAAGTTTGT TCAAGATTGG CTGTCAAGAA 

TCATGGACTG TTATTATATG CCTTGTTTTC TGTCAAGACA CC ATG ATT CCT GGT 

Met lie Pro Gly 
1 

AAC CGA ATG CTG ATG GTC GTT TTA TTA TGC CAA GTC CTG CTA GGA GGC 
Asn Arg Met Leu Met Val Val Leu Leu Cys Gin Val Leu Leu Gly Gly 
5 10 15 20 
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GCG AGC CAT GCT AGT TTG ATA CCT GAG ACG GGG AAG AAA AAA GTC GCC 510 

Ala Ser His Ala Ser Leu lie Pro Glu Thr Gly Lys Lys Lys Val Ala 

25 30 35 

GAG ATT CAG GGC CAC GCG GGA GGA CGC CGC TCA GGG CAG AGC CAT GAG 55 8 

Glu lie Gin Gly His Ala Gly Gly Arg Arg Ser Gly Gin Ser His Glu 
40 45 50 

CTC CTG CGG GAC TTC GAG GCG ACA CTT CTG CAG ATG TTT GGG CTG CGC 6 06 

Leu Leu Arg Asp Phe Glu Ala Thr Leu Leu Gin Met Phe Gly Leu Arg 
55 60 65 

CGC CGC CCG CAG CCT AGC AAG AGT GCC GTC ATT CCG GAC TAG ATG CGG 654 
Arg Arg Pro Gin Pro Ser Lys Ser Ala Val lie Pro Asp Tyr Met Arg 
70 75 80 

GAT CTT TAC CGG CTT CAG TCT GGG GAG GAG GAG GAA GAG CAG ATC CAC 7 02 

Asp Leu Tyr Arg Leu Gin Ser Gly Glu Glu Glu Glu Glu Gin lie His 
85 90 95 100 

AGC ACT GGT CTT GAG TAT CQT GAG CGC CCG GCC AGC CGG GCC AAC ACC 7 50 

Ser Thr Gly Leu Glu Tyr Pro Glu Arg Pro Ala Ser Arg Ala Asn Thr 
105 no us 

GTG AGG AGC TTC CAC CAC GAA GAA CAT CTG GAG AAC ATC CCA GGG ACC 798 
Val Arg Ser Phe His His Glu Glu His Leu Glu Asn He Pro Gly Thr 
120 125 130 

AGT GAA AAC TCT GCT TTT CGT TTC CTC TTT AAC CTC AGC AGC ATC CCT 84 6 

Ser Glu Asn Ser Ala Phe Arg Phe Leu Phe Asn Leu Ser Ser He Pro 
135 140 145 

GAG AAC GAG GTG ATC TCC TCT GCA GAG CTT CGG CTC TTC CGG GAG CAG 894 
Glu Asn Glu Val He Ser Ser Ala Glu Leu Arg Leu Phe Arg Glu Gin 
150 155 160 

GTG GAC CAG GGC CCT GAT TGG GAA AGG GGC TTC CAC CGT ATA AAC ATT 94 2 

Val Asp Gin Gly Pro Asp Trp Glu Arg Gly Phe His Arg He Asn He 
16 5 170 175 ' 180 

TAT GAG GTT ATG AAG CCC CCA GCA GAA GTG GTG CCT GGG CAC CTC ATC 99 0 

Tyr Glu Val Met Lys Pro Pro Ala Glu Val Val Pro Gly His Leu He 
185 190 195 

ACA CGA CTA CTG GAC ACG AGA CTG GTC CAC CAC AAT GTG ACA CGG TGG 103 8 

Thr Arg Leu Leu Asp Thr Arg Leu Val His His Asn Val Thr Arg Trp 
200 205 210 

GAA ACT TTT GAT GTG AGC CCT GCG GTC CTT CGC TGG ACC CGG GAG AAG 108 6 

Glu Thr Phe Asp Val Ser Pro Ala Val Leu Arg Trp Thr Arg Glu Lys 
215 220 225 

CAG CCA AAC TAT GGG CTA GCC ATT GAG GTG ACT CAC CTC CAT CAG ACT 113 4 

Gin Pro Asn Tyr Gly Leu Ala He Glu Val Thr His Leu His Gin Thr 
230 235 240 

CGG ACC CAC CAG GGC CAG CAT GTC AGG ATT AGC CGA TCG TTA CCT CAA 11 62 

Arg Thr His Gin Gly Gin His Val Arg He Ser Arg Ser Leu Pro Gin 
245 250 255 260 
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1230 



1278 



G6G AGT GGG AAT TGG GCC CAG CTC CGG CCC CTC CTG GTC ACC TTT GGC 
Gly Ser Gly Asn Trp Ala Gin Leu Arg Pro Leu Leu Val Tnr pne bxy 
265 270 

CAT GAT GGC CGG GGC CAT GCC TTG ACC CGA CGC CGG AGG GCC AAG CGT 
HiJ Asp Gly Arg Gly His Ala Leu Thr Arg Arg Arg Arg Ala Lys Arg 
280 285 290 

AGC CCT AAG CAT CAC TCA CAG CGG GCC AGG AAG AAG AAT AAG AAC TGC 1326 
ser Pro Lys His His Ser Gin Arg Ala Arg Lys Lys Asn Lys Asn Cys 
295 300 305 

CGG CGC CAC TCG CTC TAT GTG GAC TTC AGC GAT GTG GGC TGG AAT GAC 
Arg Arg His Ser Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asn Asp 
310 315 320 



1374 



1422 



1470 



TGG ATT GTG GCC CCA CCA GGC TAC CAG GCC TTC TAC TGC CAT GGG GAC 
Trp He Val Ala Pro Pro Gly Tyr Gin Ala Phe Tyr Cys His Gly Asp 

325 330 335 

TGC CCC TTT CCA CTG GCT GAC CAC CTC AAC TCA ACC AAC CAT GCC ATT 
cys Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr Asn His Ala He 
1 345 - 3 50 355 

GTG CAG ACC CTG GTC AAT TCT GTC AAT TCC AGT ATC CCC AAA GCC TGT 1518 
Val Gin Thr Leu Val Asn Ser Val Asn Ser Ser He Pro Lys Ala Cys 
360 365 370 

TGT GTG CCC ACT GAA CTG. AGT GCC ATC TCC ATG CTG TAC CTG GAT GAG 1566 
cys Val Pro Thr Glu Leu Ser Ala He Ser Met Leu Tyr Leu Asp Glu 
375 380 385 

TAT GAT AAG GTG GTA CTG AAA AAT TAT CAG GAG ATG GTA GTA GAG GGA 1614 
Tyr Asp Lys Val Val Leu Lys Asn Tyr Gin Glu Met Val Val Glu Gly 
390 395 400 

TGT GGG TGC CGC TGAGATCAGG CAGTCCTTGA GGATAGACAG ATATACACAC 1666 

Cys Gly Cys Arg 

405 

CACACACACA CACCACATAC ACCACACACA CACGTTCCCA TCCACTCACC CACACACTAC 1726 

ACAGACTGCT TCCTTATAGC TGGACTTTTA TTTAAAAAAA AAAAAAAAAA AATGGAAAAA 1786 

ATCCCTAAAC ATTCACCTTG ACCTTATTTA TGACTTTACG TGCAAATGTT TTGACCATAT 184 6 

TGATCATATA TTTTGACAAA ATATATTTAT AACTACGTAT TAAAAGAAAA AAATAAAATG 1906 

AGTCATTATT TTAAAAAAAA AAAAAAAACT CTAGAGTCGA CGGAATTC 195* 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 408 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY; linear * 
(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met lie Pro Gly Asn Arg Met Leu Met Val Val Leu Leu Cys Gin Val 
1 5 10 15 

Leu Leu Gly Gly Ala Ser His Ala Ser Leu He Pro Glu Thr Gly Lys 
20 25 30 

Lys Lys Val Ala Glu He Gin Gly His Ala Gly Gly Arg Arg Ser Gly 
35 40 45 

Gin Ser His Glu Leu Leu Arg Asp Phe Glu Ala Thr Leu Leu Gin Met 
50 55 60 

Phe Gly Leu Arg Arg Arg Pro Gin Pro Ser Lys Ser Ala Val He Pro 
65 70 75 80 

Asp Tyr Met Arg Asp Leu Tyr Arg Leu Gin Ser Gly Glu Glu Glu Glu 

85 90 95 

Glu Gin He His Ser Thr Gly Leu Glu Tyr Pro Glu Arg Pro Ala Ser 
100 105 110 

Arg Ala Asn Thr Val Arg Ser Phe His His Glu Glu His Leu Glu Asn 
115 120 125 

He Pro Gly Thr Ser Glu Asn Ser Ala Phe Arg Phe Leu Phe Asn Leu 
130 135 140 

Ser Ser He Pro Glu Asn Glu Val He Ser Ser Ala Glu Leu Arg Leu 
145 150" 155 160 

Phe Arg Glu Gin Val Asp Gin Gly Pro Asp Trp Glu Arg Gly Phe His 
165 170 175 

Arg He Asn He Tyr Glu Val Met Lys Pro Pro Ala Glu Val Val Pro 
180 185 190 

Gly His Leu He Thr Arg Leu Leu Asp Thr Arg Leu Val His His Asn 
195 200 205 

Val Thr Arg Trp Glu Thr Phe Asp Val Ser Pro Ala Val Leu Arg Trp 
210 215 220 

Thr Arg Glu Lys Gin Pro Asn Tyr Gly Leu Ala He Glu Val Thr His 
225 230 235 240 

Leu His Gin Thr Arg Thr His Gin Gly Gin His Val Arg He Ser Arg 
245 250 255 

Ser Leu Pro Gin Gly Ser Gly Asn Trp Ala Gin Leu Arg Pro Leu Leu 
260 265 270 

Val Thr Phe Gly His Asp Gly Arg Gly His Ala Leu Thr Arg Arg Arg 
275 280 285 

Arg Ala Lys Arg Ser Pro Lys His His Ser Gin Arg Ala Arg Lys Lys 
290 295 300 
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Asn Lys Asn Cys Arg Arg His Ser Leu Tyr Val Asp Phe Ser Asp Val 
305 310 315 3ZU 

Gly Trp Asn Asp Trp He Val Ala Pro Pro Gly Tyr Gin Ala Phe Tyr 

Cys His Gly Asp Cys Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr 
* 340 345 350 

Asn His Ala He Val Gin Thr Leu Val Asn Ser Val Asn Ser Ser He 
355 360 365 

Pro Lys Ala Cys Cys Val Pro Thr Glu Leu Ser Ala He Ser Met Leu 
370 375 380 

Tyr Leu Asp Glu Tyr Asp Lys Val Val Leu Lys Asn Tyr Gin Glu Met 
385 390 395 

Val Val Glu Gly Cys Gly Cys Arg 
4 05 

(2) INFORMATION FOR SEQ ID N0:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1448 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

* 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 97.. 1389 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GTGACCGAGC GGCGCGGACG GCCGCCTGCC CCCTCTGCCA CCTGGGGCGG TGCGGGCCCG 

GAGCCCGGAG CCCGGGTAGC GCGTAGAGCC GGCGCG ATG CAC GTG CGC TCA CTG 

Met His Val Arg Ser Leu 
1 5 

CGA GCT GCG GCG CCG CAC AGC TTC GTG GCG CTC TGG GCA CCC CTG TTC 
Arg Ala Ala Ala Pro His Ser Phe Val Ala Leu Trp Ala Pro Leu Phe 
10 15 20 

CTG CTG CGC TCC GCC CTG GCC GAC TTC AGC CTG GAC AAC GAG GTG CAC 
Leu Leu Arg Ser Ala Leu Ala Asp Phe Ser Leu Asp Asn Glu Val His 
25 30 35 

TCG AGC TTC ATC CAC CGG CGC CTC CGC AGC CAG GAG CGG CGG GAG ATG 
Ser Ser Phe He His Arg Arg Leu Arg Ser Gin Glu Arg Arg Glu Met 
40 45 50 

CAG CGC GAG ATC CTC TCC ATT TTG GGC TTG CCC CAC CGC CCG CGC CCG 
Gin Arg Glu He Leu Ser He Leu Gly Leu Pro His Arg Pro Arg Pro 
55 60 65 70 
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CAC CTC CAG GGC AAG CAC AAC TCG GCA CCC ATG TTC ATG CTG GAC CTG 3 54 

His Leu Gin Gly Lys His Asn Ser Ala Pro Met Phe Met Leu Asp Leu 

75 80 85 

TAC AAC GCC ATG GCG GTG GAG GAG GGC GGC GGG CCC GGC GGC CAG GGC 4 02 

Tyr Asn Ala Met Ala Val Glu Glu Gly Gly Gly Pro Gly Gly Gin Gly 
90 95 100 

TTC TCC TAC CCC TAC AAG GCC GTC TTC AGT ACC CAG GGC CCC CCT CTG 4 50 

Phe Ser Tyr Pro Tyr Lys Ala Val Phe Ser Thr Gin Gly Pro Pro Leu 
105 110 115 

GCC AGC CTG CAA GAT AGC CAT TTC CTC ACC GAC GCC GAC ATG GTC ATG 4 98 

Ala Ser Leu Gin Asp Ser His Phe Leu Thr Asp Ala Asp Met Val Met 
120 125 130 

AGC TTC GTC AAC CTC GTG GAA CAT GAC AAG GAA TTC TTC CAC CCA CGC 54 6 

Ser Phe Val Asn Leu Val Glu His Asp Lys Glu Phe Phe His Pro Arg 
135 140 145 150 

TAC CAC CAT CGA GAG TTC CGG TTT GAT CTT TCC AAG ATC CCA GAA GGG 59 4 

Tyr His His Arg Glu Phe Arg Phe Asp Leu Ser Lys lie Pro Glu Gly 
155 160 165 

GAA GCT GTC ACG GCA GCC GAA TTC CGG ATC TAC AAG GAC TAC ATC CGG 64 2 

Glu Ala Val Thr Ala Ala Glu Phe Arg lie Tyr Lys Asp Tyr lie Arg 
170 175 180 

GAA CGC TTC GAC AAT GAG ACG TTC CGG ATC AGC GTT TAT CAG GTG CTC 69 0 

Glu Arg Phe Asp Asn Glu Thr Phe Arg lie Ser Val Tyr Gin Val Leu 
185 190 195 

CAG GAG CAC TTG GGC AGG GAA TCG GAT CTC TTC CTG CTC GAC AGC CGT 738 
Gin Glu His Leu Gly Arg Glu Ser Asp Leu Phe Leu Leu Asp Ser Arg 
200 205 210 

ACC CTC TGG GCC TCG GAG GAG GGC TGG CTG GTG TTT GAC ATC ACA GCC 78 6 

Thr Leu Trp Ala Ser Glu Glu Gly Trp Leu Val Phe Asp He Thr Ala 
215 220 225 230 

ACC AGC AAC CAC TGG GTG GTC AAT CCG CGG CAC AAC CTG GGC CTG CAG 8 34 

Thr Ser Asn His Trp Val Val Asn Pro Arg His Asn Leu Gly Leu Gin 
235 240 245 

CTC TCG GTG GAG ACG CTG GAT GGG CAG AGC ATC AAC CCC AAG TTG GCG 8 82 

Leu Ser Val Glu Thr Leu Asp Gly Gin Ser He Asn Pro Lys Leu Ala 
250 255 260 

GGC CTG ATT GGG CGG CAC GGG CCC CAG AAC AAG CAG CCC TTC ATG GTG 93 0 

Gly Leu He Gly Arg His Gly Pro Gin Asn Lys Gin Pro Phe Met Val 
265 270 275 

GCT TTC TTC AAG GCC ACG GAG GTC CAC TTC CGC AGC ATC CGG TCC ACG 978 
Ala Phe Phe Lys Ala Thr Glu Val His Phe Arg Ser He Arg Ser Thr 
280 285 290 

GGG AGC AAA CAG CGC AGC CAG AAC CGC TCC AAG ACG CCC AAG AAC CAG 102 6 

Gly Ser Lys Gin Arg Ser Gin Asn Arg Ser Lys Thr Pro Lys Asn Gin 
295 300 305 310 
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G « GCC CTG CGG ATG GCC UC GTG GCA «0 UC AGC AGC AGC GAC CAG 

Glu Ala Leu Arg Met Ala Asn Val Ala Giu ash oei. o | 
315 320 

S SK S 5 s s s s s - « - = 3 5 s 

o->n 335 



GGC TGG CAG GAC TGG ATC ATC GCG CCT GAA GGC TAC GCC GCC TAC TAC 
Gly Trp Gin Asp Trp He He Ala Pro Glu Gly Tyr Ala Ala Tyr Tyr 
345 350 355 

TGT GAG GGG GAG TGT GCC TTC CCT CTG AAC TCC TAC ATG AAC GCC ACC 
cys Glu Gly Glu cys Ala Phe Pro Leu Asn Ser Tyr Met Asn Ala Thr 
360 365 370 

sssssassssssssas 

375 380 385 

GTG CCC AAG CCC TGC TGT GCG CCC ACG CAG CTC AAT GCC ATC TCC GTC 

Val Pro Lys Pro Cys Cys Ala Pro Thr Gin Leu Asn Ala He Ser Val 
395 400 

CTC TAC TTC GAT GAC AGC TCC AAC GTC ATC CTG AAG AAA TAC AGA AAC 
2E ?yr £5e £p Asp Ser Ser Asn Val He Leu Lys Lys Tyr Arg Asn 
410 415 

ATG GTG GTC CGG GCC TGT GGC TGC CAC TAGCTCCTCC GAGAATTCAG 
Met Val Val Arg Ala Cys Gly Cys His 
425 430 

ACCCTTTGGG GCCAAGTTTT TCTGGATCCT CCATTGCTC 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 431 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met His Val Arg Ser Leu Arg Ala Ala Ala Pro His Ser Phe Val Ala 
1 5 1° XO 

Leu Trp Ala Pro Leu Phe Leu Leu Arg Ser Ala Leu Ala Asp Phe Ser 
20 25 30 

Leu Asp Asn Glu Val His Ser Ser Phe He His Arg Arg Leu Arg Ser 
35 40 45 

Gin Glu Arg Arg Glu Met Gin Arg Glu He Leu Ser He Leu Gly Leu 
50 55 60 

Pro His Arg Pro Arg Pro His Leu Gin Gly Lys His Asn Ser Ala Pro 
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1122 



1170 



1218 



1266 



1314 



1362 



1409 



1448 
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65 70 75 80 

Met Phe Met Leu Asp Leu Tyr Asn Ala Met Ala Val Glu Glu Gly Gly 
85 90 95 

Gly Pro Gly Gly Gin Gly Phe Ser Tyr Pro Tyr Lys Ala Val Phe Ser 
100 105 110 

Thr Gin Gly Pro Pro Leu Ala Ser Leu Gin Asp Ser His Phe Leu Thr 
115 120 125 

Asp Ala Asp Met Val Met Ser Phe Val Asn Leu Val Glu His Asp Lys 
130 135 140 

Glu Phe Phe His Pro Arg Tyr His His Arg Glu Phe Arg Phe Asp Leu 
145 150 155 160 

Ser Lys lie Pro Glu Gly Glu Ala Val Thr Ala Ala Glu Phe Arg He 
165 170 175 

Tyr Lys Asp Tyr He Arg Glu Arg Phe Asp Asn Glu Thr Phe Arg He 
180 185 190 

Ser Val Tyr Gin Val Leu Gin Glu His Leu Gly Arg Glu Ser Asp Leu 
195 " 200 205 

Phe Leu Leu Asp Ser Arg Thr Leu Trp Ala Ser Glu Glu Gly Trp Leu 
210 215 220 

Val Phe Asp He Thr Ala Thr Ser Asn His Trp Val Val Asn Pro Arg 
225 230 235 240 

His Asn Leu Gly Leu Gin Leu Ser Val Glu Thr Leu Asp Gly Gin Ser 
245 250 255 

He Asn Pro Lys Leu Ala Gly Leu He Gly Arg His Gly Pro Gin Asn 
260 265 270 

Lys Gin Pro Phe Met Val Ala Phe Phe Lys Ala Thr Glu Val His Phe 
275 280 285 

Arg Ser He Arg Ser Thr Gly Ser Lys Gin Arg Ser Gin Asn Arg Ser 
290 295 300 

Lys Thr Pro Lys Asn Gin Glu Ala Leu Arg Met Ala Asn Val Ala Glu 
305 310 315 320 

Asn Ser Ser Ser Asp Gin Arg Gin Ala Cys Lys Lys His Glu Leu Tyr 
325 330 335 

Val Ser Phe Arg Asp Leu Gly Trp Gin Asp Trp He He Ala Pro Glu 
340 345 350 

Gly Tyr Ala Ala Tyr Tyr Cys Glu Gly Glu Cys Ala Phe Pro Leu Asn 
355 360 365 

Ser Tyr Met Asn Ala Thr Asn His Ala He Val Gin Thr Leu Val His 
370 375 380 

Phe He Asn Pro Glu Thr Val Pro Lys Pro Cys Cys Ala Pro Thr Gin 
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385 39° 

Leu Asn Ala lie Ser Val Leu Tyr Phe Asp Asp Ser Ser Asn Val He 

a r\d 410 HXO 



390 395 400 



405 



Leu Lys Lys Tyr Arg Asn Met Val Val Arg Ala Cys Gly Cys His 
420 425 43 

(2) INFORMATION FOR SEQ ID N0:7: 

(i) SEQUENCE CHARACTERISTICS: ; 

(A) LENGTH: 2923 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : circular 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: -Homo sapiens 

(F) TISSUE TYPE: Human placenta 

fvii) IMMEDIATE SOURCE: , . 

(A) LIBRARY: Stratagene catalog S936203 Human placenta 

cDNA library 

(B) CLONE: BMP6C35 

(Viii) POSITION IN GENOME: 

(C) UNITS: bp 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 160. ,1701 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1282.. 1698 

(ix) FEATURE: 

(A) NAME/KEY: mRNA 

(B) LOCATION: 1..2923 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CGACCATGAG AGATAAGGAC TGAGGGCCAG GAAGGGGAAG CGAGCCCGCC GAGAGGTGGC 60 

GGGGACTGCT CACGCCAAGG GCCACAGCGG CCGCGCTCCG GCCTCGCTCC GCCGCTCCAC 120 

GCCTCGCGGG ATCCGCGGGG GCAGCCCGGC CGGGCGGGG ATG CCG GGG CTG GGG 174 

Met Pro Gly Leu Gly 
-374 -370 

CGG AGG GCG CAG TGG CTG TGC TGG TGG TGG GGG CTG CTG TGC AGC TGC 222 
Arg Arg Ala Gin Trp Leu Cys Trp Trp Trp Gly Leu Leu Cys Ser Cys 
-365 -360 "355 

TGC GGG CCC CCG CCG CTG CGG CCG CCC TTG CCC GCT GCC GCG GCC GCC 270 
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Cys Gly Pro Pro Pro Leu Arg Pro Pro Leu Pro Ala Ala Ala Ala Ala 
-350 -345 -340 

GCC GCC GGG GGG CAG CTG CTG GGG GAC GGC GGG AGC CCC GGC CGC ACG 318 
Ala Ala Gly Gly Gin Leu Leu Gly Asp Gly Gly Ser Pro Gly Arg Thr 
"335 -330 -325 

GAG CAG CCG CCG CCG TCG CCG CAG TCC TCC TCG GGC TTC CTG TAC CGG 3 66 

Glu Gin Pro Pro Pro Ser Pro Gin Ser Ser Ser Gly Phe Leu Tyr Arg 
-320 -315 -310 

CGG CTC AAG ACG CAG GAG AAG CGG GAG ATG CAG AAG GAG ATC TTG TCG 414 
Arg Leu Lys Thr Gin Glu Lys Arg Glu Met Gin Lys Glu lie Leu Ser 
-305 -300 -295 -290 

GTG CTG GGG CTC CCG CAC CGG CCC CGG CCC CTG CAC GGC CTC CAA CAG 4 62 

Val Leu Gly Leu Pro His Arg Pro Arg Pro Leu His Gly Leu Gin Gin 
-285 -280 -275 

CCG CAG CCC CCG GCG CTC CGG CAG CAG GAG GAG CAG CAG CAG CAG CAG 510 
Pro Gin Pro Pro Ala Leu Arg Gin Gin Glu Glu Gin Gin Gin Gin Gin 
-270 -265 -260 

CAG CTG CCT CGC GGA GAG CCC CCT CCC GGG CGA CTG AAG TCC GCG CCC 558 
Gin Leu Pro Arg Gly du Pro Pro Pro Gly Arg Leu Lys Ser Ala Pro 
-255 -250 -245 

CTC TTC ATG CTG GAT CTG TAC AAC GCC CTG TCC GCC GAC AAC GAC GAG 606 
Leu Phe Met Leu Asp Leu Tyr Asn Ala Leu Ser Ala Asp Asn Asp Glu 
-240 . -235 -230 

GAC GGG GCG TCG GAG GGG GAG AGG CAG CAG TCC TGG CCC CAC GAA GCA 654 
Asp Gly Ala Ser Glu Gly Glu Arg Gin Gin Ser Trp Pro His Glu Ala 
-225 -220 -215 -210 

GCC AGC TCG TCC CAG CGT CGG CAG CCG CCC CCG GGC GCC GCG CAC CCG 7 02 

Ala Ser Ser Ser Gin Arg Arg Gin Pro Pro Pro Gly Ala Ala His Pro 
-205 -200 -195 

CTC AAC CGC AAG AGC CTT CTG GCC CCC GGA TCT GGC AGC GGC GGC GCG 750 
Leu Asn Arg Lys Ser Leu Leu Ala Pro Gly Ser Gly Ser Gly Gly Ala 
-190 -185 -180 

TCC CCA CTG ACC AGC GCG CAG GAC AGC GCC TTC CTC AAC GAC GCG GAC 79 8 

Ser Pro Leu Thr Ser Ala Gin Asp Ser Ala Phe Leu Asn Asp Ala Asp 
-175 -170 -165 

ATG GTC ATG AGC TTT GTG AAC CTG GTG GAG TAC GAC AAG GAG TTC TCC 84 6 

Met Val Met Ser Phe Val Asn Leu Val Glu Tyr Asp Lys Glu Phe Ser 
-160 -155 -150 

CCT CGT CAG CGA CAC CAC AAA GAG TTC AAG TTC AAC TTA TCC CAG ATT 894 
Pro Arg Gin Arg His His Lys Glu Phe Lys Phe Asn Leu Ser Gin lie 
-145 -140 -135 -130 

CCT GAG GGT GAG GTG GTG ACG GCT GCA GAA TTC CGC ATC TAC AAG GAC 94 2 

Pro Glu Gly Glu Val Val Thr Ala Ala Glu Phe Arg He Tyr Lys Asp 
-125 -120 -115 

TGT GTT ATG GGG AGT TTT AAA AAC CAA ACT TTT CTT ATC AGC ATT TAT 99 0 



WO 93/09229 

104 

Cys Val Met Gly Ser Phe Lys Asn Gin Thr Phe Leu lie Ser lie Tyr 
-110 -105 -100 

CAA GTC TTA CAG GAG CAT CAG CAC AG A GAC TCT GAC CTG TTT TTG TTG 1038 
Gin Val Leu Gin Glu His Gin His Arg Asp Ser Asp Leu Phe Leu Leu 
-95 -90 -85 

GAC ACC CGT GTA GTA TGG GCC TCA GAA GAA GGC TGG CTG GAA TTT GAC 1086 
Asp Thr Arg Val Val Trp Ala Ser Glu Glu Gly Trp Leu Glu Phe Asp 
-80 -75 -70 

ATC ACG GCC ACT AGC AAT CTG TGG GTT GTG ACT CCA CAG CAT AAC ATG 11^4 
lie Thr Ala Thr Ser Asn Leu Trp Val Val Thr Pro Gin His Asn Met 
-65 -60 -55 -50 

GGG CTT CAG CTG AGC GTG GTG ACA AGG GAT GGA GTC CAC GTC CAC CCC 1182 
Gly Leu Gin Leu Ser Val Val Thr Arg Asp Gly Val His Val His Pro 
-45 -40 -35 

CGA GCC GCA GGC CTG GTG GGC AGA GAC GGC CCT TAC GAT AAG CAG CCC 1230 
Arg Ala Ala Gly Leu Val Gly Arg Asp Gly Pro Tyr Asp Lys Gin Pro 
-30 . -25 ' -20 

TTC ATG GTG GCT TTC TTC AAA GTG AGT GAG GTC CAC GTG CGC ACC ACC 1278 
Phe Met Val Ala Phe Plie Lys Val Ser Glu Val His Val Arg Thr Thr 
-15 -10 -5 

AGG TCA GCC TCC AGC CGG CGC CGA CAA CAG AGT CGT AAT CGC TCT ACC 132 6 

Arg Ser Ala Ser Ser Arg Arg Arg Gin Gin Ser Arg Asn Arg Ser Thr 
* 1 5 10 - - ' 15 

CAG TCC CAG GAC GTG GCG CGG GTC TCC AGT GCT TCA GAT TAC AAC AGC 1374 
Gin Ser Gin Asp Val Ala Arg Val Ser Ser Ala Ser Asp Tyr Asn Ser 

20 25 30 

AGT GAA TTG AAA ACA GCC TGC AGG AAG CAT GAG CTG TAT GTG AGT TTC 1422 
Ser Glu Leu Lys Thr Ala Cys Arg Lys His Glu Leu Tyr Val Ser Phe 
35 40 *45 

CAA GAC CTG GGA TGG CAG GAC TGG ATC ATT GCA CCC AAG GGC TAT GCT 1470 
Gin Asp Leu Gly Trp Gin Asp Trp lie lie Ala Pro Lys Gly Tyr Ala 
50 55 60 

GCC AAT TAC TGT GAT GGA GAA TGC TCC TTC CCA CTC AAC GCA CAC ATG 1518 
Ala Asn Tyr Cys Asp Gly Glu Cys Ser Phe Pro Leu Asn Ala His Met 
65 70 75 

AAT GCA ACC AAC CAC GCG ATT GTG CAG ACC TTG GTT CAC CTT ATG AAC 1566 
Asn Ala Thr Asn His Ala He Val Gin Thr Leu Val His Leu Met Asn 
80 85 90 95 

CCC GAG TAT GTC CCC AAA CCG TGC TGT GCG CCA ACT AAG CTA AAT GCC 1614 
Pro Glu Tyr Val Pro Lys Pro Cys Cys Ala Pro Thr Lys Leu Asn Ala 

100 105 110 ^ 

ATC TCG GTT CTT TAC TTT GAT GAC AAC TCC AAT GTC ATT CTG AAA AAA 1662 
He Ser Val Leu Tyr Phe Asp Asp Asn Ser Asn Val He Leu Lys Lys 

115 120 125 , 

TAC AGG AAT ATG GTT GTA AGA GCT TGT GGA TGC CAC TAACTCGAAA 1708 
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Tyr Arg Asn Met Val Val Arg Ala Cys Gly Cys His 

130 135 140 

CCAGATGCTG GGGACACACA TTCTGCCTTG GATTCCTAGA TTACATCTGC CTTAAAAAAA 
CACGGAAGCA CAGTTGGAGG TGGGACGATG AGACTTTGAA ACTATCTCAT GCCAGTGCCT 
TATTACCCAG GAAGATTTTA AAGGACCTCA TTAATAATTT GCTCACTTGG TAAATGACGT 
GAGTAGTTGT TGGTCTGTAG CAAGCTGAGT TTGGATGTCT GTAGCATAAG GTCTGGTAAC 
TGCAGAAACA TAACCGTGAA GCTCTTCCTA CCCTCCTCCC CCAAAAACCC ACCAAAATTA 
GTTTTAGCTG TAGATCAAGC TATTTGGGGT GTTTGTTAGT AAATAGGGAA AATAATCTCA 
AAGGAGTTAA ATGTATTCTT GGCTAAAGGA TCAGCTGGTT CAGTACTGTC TATCAAAGGT 
AGATTTTACA GAGAACAGAA ATCGGGGAAG TGGGGGGAAC GCCTCTGTTC AGTTCATTCC 
CAGAAGTCCA CAGGACGCAC AGCCCAGGCC ACAGCCAGGG CTCCACGGGG CGCCCTTGTC 
TCAGTCATTG CTGTTGTATG TTCGTGCTGG AGTTTTGTTG GTGTGAAAAT ACACTTATTT 
CAGCCAAAAC ATACCATTTC TACACCTCAA TCCTCCATTT GCTGTACTCT TTGCTAGTAC 
CAAAAGTAGA CTGATTACAC TGAGGTGAGG CTACAAGGGG TGTGTAACCG TGTAACACGT 
GAAGGCAGTG CTCACCTCTT CTTTACCAGA ACGGTTCTTT GACCAGCACA TTAACTTCTG 
GACTGCCGGC TCTAGTACCT TTTCAGTAAA GTGGTTCTCT GCCTTTTTAC TATACAGCAT 
ACCACGCCAC AGGGTTAGAA CCAACGAAGA AAATAAAATG AGGGTGCCCA GCTTATAAGA 
ATGGTGTTAG GGGGATGAGC ATGCTGTTTA TGAACGGAAA TCATGATTTC CCTGTAGAAA 
GTGAGGCTCA GATTAAATTT TAGAATATTT TCTAAATGTC TTTTTCACAA TCATGTGACT 
GGGAAGGCAA TTTCATACTA AACTGATTAA ATAATACATT TATAATCTAC AACTGTTTGC 
ACTTACAGCT TTTTTTGTAA ATATAAACTA TAATTTATTG TCTATTTTAT ATCTGTTTTG 
CTGTGGCGTT GGGGGGGGGG CCGGGCTTTT GGGGGGGGGG GTTTGTTTGG GGGGTGTCGT 
GGTGTGGGCG GGCGG 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 513 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

M ^„ Pr ° Gly LeU Gly Arg Arg Ala Gln Tr P Leu c y £ Trp Trp Trp Glv 
~ 374 " 370 "365 P _ 36 £ 



1768 

1828 

1888 

1948 

2008 

2068 

2128 

2188 

2248 

2308 

2368 

2428 

2488 

2548 

2608 

2668 

2728 

2788 

2848 

2908 

2923 
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Leu Leu Cys Ser Cys Cys Gly Pro Pro Pro Leu Arg Pro Pro Leu Pro 
-355 -350 -345 

Ala Ala Ala Ala Ala Ala Ala Gly Gly Gin Leu Leu Gly Asp Gly Gly 
-340 -335 -330 

Ser Pro Gly Arg Thr Glu Gin Pro Pro Pro Ser Pro Gin Ser Ser Ser 
-325 -320 "315 

Gly Phe Leu Tyr Arg Arg Leu Lys Thr Gin Glu Lys Arg Glu Met Gin 
-310 -305 "300 -2 

Lys Glu lie Leu Ser Val Leu Gly Leu Pro His Arg Pro Arg Pro Leu 
-290 -285 -280 

His Gly Leu Gin Gin Pro Gin Pro Pro Ala Leu Arg Gin Gin Glu Glu 
-275 -270 -265 

Gin Gin Gin Gin Gin Gin Leu Pro Arg Gly Glu Pro Pro Pro Gly Arg 
-260 -255 -250 

Leu Lys Ser Ala Pro Leu Phe Met Leu Asp Leu Tyr Asn Ala Leu Ser 
-245 * -240 -235 

Ala Asp Asn Asp Glu AsJp Gly Ala Ser Glu Gly Glu Arg Gin Gin Ser 
-230 -225 -220 -21 

Trp Pro His Glu Ala Ala Ser Ser Ser Gin Arg Arg Gin Pro Pro Pro 
-210 -205 -200 

Gly Ala Ala His # Pro Leu Asn Arg Lys Ser Leu Leu Ala Pro Gly Ser 
-195 -190 -185 

Gly Ser Gly Gly Ala Ser Pro Leu Thr Ser Ala Gin Asp Ser Ala Phe 
-180 -175 -170 

Leu Asn Asp Ala Asp Met Val Met Ser Phe Val Asn Leu Val Glu Tyr 
-165 -160 -155 

Asp Lys Glu Phe Ser Pro Arg Gin Arg His His Lys Glu Phe £ys Phe 
-150 -145 -140 -13 

Asn Leu Ser Gin lie Pro Glu Gly Glu Val Val Thr Ala Ala Glu Phe 
-130 -125 -120 

Arg lie Tyr Lys Asp Cys Val Met Gly Ser Phe Lys Asn Gin Thr Phe 
-115 -110 -105 

Leu lie Ser lie Tyr Gin Val Leu Gin Glu His Gin His Arg Asp Ser 
-100 -95 -90 

Asp Leu Phe Leu Leu Asp Thr Arg Val Val Trp Ala Ser Glu Glu Gly 
-85 -80 -75 

Trp Leu Glu Phe Asp lie Thr Ala Thr Ser Asn Leu Trp Val Val Thr 
-70 -65 -60 -55 

Pro Gin His Asn Met Gly Leu Gin Leu Ser Val Val Thr Arg Asp Gly 
-50 -45 -40 



107 



Val His Val His Pro Arg Ala Ala Gly Leu Val Gly Arg 
-35 -30 

Tyr Asp Lys Gin Pro Phe Met Val Ala Phe Phe Lys Val 
-20 -15 -10 

His Val Arg Thr Thr Arg Ser Ala Ser Ser Arg Arg Arg 
-5 15 

Arg Asn Arg Ser Thr Gin Ser Gin Asp Val Ala Arg Val 

15 20 

Ser Asp Tyr Asn Ser Ser Glu Leu Lys Thr Ala Cys Arg 
30 35 

Leu Tyr Val Ser Phe Gin Asp Leu Gly Trp Gin Asp Trp 
45 50 55 

Pro Lys Gly Tyr Ala Ala Asn Tyr Cys Asp Gly Glu Cys 
60 65 70 

Leu Asn Ala His Met Asn Ala Thr Asn His Ala He Val 
75 80 * 85 

Val His Leu Met Asn Pro Glu Tyr Val Pro Lys Pro Cys 

95 100 

Thr Lys Leu Asn Ala He Ser Val Leu Tyr Phe Asp Asp 
110 us 

Val He Leu Lys Lys Tyr Arg Asn Met Val Val Arg Ala 
125 130 135 

His 



Asp Gly Pro 
-25 

Ser Glu Val 



Gin Gin Ser 
10 

Ser Ser Ala 
25 

Lys His Glu 
40 

He He Ala 
Ser Phe Pro 



Gin Thr Leu 

90 

Cys Ala Pro 
105 

Asn Ser Asn 
120 

Cys Gly Cys 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2153 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(H) CELL LINE: U2-OS osteosarcoma 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: U2-0S human osteosarcoma cDNA library 

(B) CLONE: U2-16 

(viii) POSITION IN GENOME: 

(C) UNITS: bp 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 
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(ix) 
(ix) 



(B) LOCATION: 

FEATURE: 

(A) NAME/ KEY : 

(B) LOCATION: 

FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 



108 

699, .2063 

mat__peptide 
1647. .2060 



HiRNA 
1. .2153 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



CTGGTATATT 


TGTGCCTGCT GGAGGTGGAA 


TTAACAGTAA 


GAAGGAGAAA 


GGGATTGAAT 


60 


GGACTTACAG 


GAAGGATTTC AAGTAAATTC 


AGGGAAACAC 


ATTTACTTGA 


ATAGTACAAC 


120 


CTAGAGTATT 


ATTTTACACT AAGACGACAC 


AAAAGATGTT 


AAAGTTATCA 


CCAAGCTGCC 


180 


GGACAGATAT 


ATATTCCAAC ACCAAGGTGC 


AGATCAGCAT 


AGATCTGTGA 


TTCAGAAATC 


240 


AGGATTTGTT 


TTG GAAAG AG CTCAAGGGTT 


GAGAAGAACT 


CAAAAGCAAG 


TGAAGATTAC 


300 


TTTGGGAACT 


ACAGTTTATC AGAAGATCAA 


CTTTTGCTAA 


TTCAAATACC 


AAAGGCCTGA 


360 


TTATCATAAA 


TTCATATAGG AATGCATAGG 


TCATCTGATC 


AAATAATATT 


AGCCGTCTTC 


420 


TGCTACATCA ATGCAGCAAA AACTCTTAAC AACTGTGGAT AATTGGAAAT 


CTGAGTTTCA 


480 


GCTTTCTTAG 


AAATAACTAC TCTTGACATA 


TTCCAAAATA 


TTTAAAATAG 


GACAGGAAAA 


•540 


TCGGTGAGGA TGTTGTGCTC AGAAATGTCA 


CTGTCATGAA 


AAATAGGTAA 


ATTTGTTTTT 


600 


TCAGCTACTG 


GGAAACTGTA CCTCCTAGAA 


CCTTAGGTTT 


T T TT^DT*TT TT' 


AAGAGGACAA 


660 


GAAGGACTAA 


AAATATCAAC TTTTGCTTTT 


GGACAAAA ATG CAT CTG ACT GTA 


713 



Met His Leu Thr Val 
-316-315 

TTT TTA CTT GQT ATT GTG GGT TTC CTC TGG AGC TQC TGG GTT cta 

Phe Leu Leu Lys Gly lie Val Gly Phe Leu Trp Ser Cys Trp Val Leu 
"310 -305 -300 

GTG GGT TAT GCA AAA GGA GGT TTG GGA GAC AAT CAT GTT CAC TCC AGT 809 
Val Gly Tyr Ala Lys Gly Gly Leu Gly Asp Asn His Val His Ser Ser 
-295 -290 -285 -280 

TTT ATT TAT AGA AGA CTA CGG AAC CAC GAA AGA CGG GAA ATA CAA AGG 857 
Phe lie Tyr Arg Arg Leu Arg Asn His Glu Arg Arg Glu lie Gin Arg 
-275 -270 -265 

GAA ATT CTC TCT ATC TTG GGT TTG CCT CAC AGA CCC AGA CCA TTT TCA 905 
Glu lie Leu Ser lie Leu Gly Leu Pro His Arg Pro Arg Pro Phe Ser 
"260 -255 -250 

CCT GGA AAA ATG ACC AAT CAA GCG TCC TCT GCA CCT CTC TTT ATG CTG 953 
Pro Gly Lys Met Thr Asn Gin Ala Ser Ser Ala Pro Leu Phe Met Leu 
-245 -240 -235 



GAT CTC TAC AAT GCC GAA GAA AAT CCT GAA GAG TCG GAG TAC TCA GTA 



1001 
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Asp Leu Tyr Asn Ala Glu Glu Asn Pro Glu Glu Ser Glu Tyr Ser Val 
-230 -225 "220 

AGG GCA TCC TTG GCA GAA GAG ACC AGA GGG GCA AGA AAG GGA TAC CCA 
Arg Ala Ser Leu Ala Glu Glu Thr Arg Gly Ala Arg Lys Gly Tyr Pro 
»215 -210 -205 -200 

GCC TCT CCC AAT GGG TAT CCT CGT CGC ATA CAG TTA TCT CGG ACG ACT 
Ala Ser Pro Asn Gly Tyr Pro Arg Arg lie Gin Leu Ser Arg Thr Thr 
-195 -190 -185 

CCT CTG ACC ACC CAG AGT CCT CCT CTA GCC AGC CTC CAT GAT ACC AAC 
Pro Leu Thr Thr Gin Ser Pro Pro Leu Ala Ser Leu His Asp Thr Asn 
-180 -175 -170 

TTT CTG AAT GAT GCT GAC ATG GTC ATG AGC TTT GTC AAC TTA GTT GAA 
Phe Leu Asn Asp Ala Asp Met Val Met Ser Phe Val Asn Leu Val Glu 
-165 -160 -155 

AGA GAC AAG GAT TTT TCT CAC CAG CGA AGG CAT TAC AAA GAA TTT CGA 
Arg Asp Lys Asp Phe Ser His Gin Arg Arg His Tyr Lys Glu Phe Arg 
-150 -145 -140 

TTT GAT CTT ACC CAA ATT CCT CAT GGA GAG GCA GTG ACA GCA GCT GAA 
Phe Asp Leu Thr Gin He Pro His Gly Glu Ala Val Thr Ala Ala Glu 
-135 -130 -125 -120 

TTC CGG ATA TAC AAG GAC CGG AGC AAC AAC CGA TTT GAA AAT GAA ACA 
Phe Arg He Tyr Lys Asp Arg Ser Asn Asn Arg Phe Glu Asn Glu Thr 
-115 -110 -105 

ATT AAG ATT AGC ATA TAT CAA ATC ATC AAG GAA TAC ACA AAT AGG GAT 
He Lys He Ser He Tyr Gin He He Lys Glu Tyr Thr Asn Arg Asp 
-100 -95 -90 

GCA GAT CTG TTC TTG TTA GAC ACA AGA AAG GCC CAA GCT TTA GAT GTG 
Ala Asp Leu Phe Leu Leu Asp Thr Arg Lys Ala Gin Ala Leu Asp Val 
-85 -80 -75 

GGT TGG CTT GTC TTT GAT ATC ACT GTG ACC AGC AAT CAT TGG GTG ATT 
Gly Trp Leu Val Phe Asp He Thr Val Thr Ser Asn His Trp Val He 
-70 -65 -60 

AAT CCC CAG AAT AAT TTG GGC TTA CAG CTC TGT GCA GAA ACA GGG GAT 
Asn Pro Gin Asn Asn Leu Gly Leu Gin Leu Cys Ala Glu Thr Gly Asp 
-55 -50 -45 -40 

GGA CGC AGT ATC AAC GTA AAA TCT GCT GGT CTT GTG GGA AGA CAG GGA 
Gly Arg Ser He Asn Val Lys Ser Ala Gly Leu Val Gly Arg Gin Gly 
-35 -30 -25 

CCT CAG TCA AAA CAA CCA TTC ATG GTG GCC TTC TTC AAG GCG AGT GAG 
Pro Gin Ser Lys Gin Pro Phe Met Val Ala Phe Phe Lys Ala Ser Glu 
-20 -15 -10 

GTA CTT CTT CGA TCC GTG AGA GCA GCC AAC AAA CGA AAA AAT CAA AAC 
Val Leu Leu Arg Ser Val Arg Ala Ala Asn Lys Arg Lys Asn Gin Asn 
-5 15 



CGC AAT AAA TCC AGC TCT CAT CAG GAC TCC TCC AGA ATG TCC AGT GTT 
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Arg Asn Lys Ser Ser Ser His Gin Asp Ser Ser Arg Met Ser Ser Val 



10 15 20 



GGA GAT TAT AAC ACA AGT GAG CAA AAA CAA GCC TGT AAG AAG CAC GAA 

tly Asp Tyr Asn Thr Ser Glu Gin Lys Gin Ala Cys Lys Lys Hxs Glu 



30 35 



CTC TAT GTG AGC TTC CGG GAT CTG GGA TGG CAG GAC TGG ATT ATA GCA 
Su Tyr val Ser Phe Arg Asp Leu Gly Trp Gin Asp Trp lie He Ala 



CCA GAA GGA TAC GCT GCA TTT TAT TGT GAT GGA GAA TGT TCT TTT CCA 
Pro Glu Gly Tyr Ala Ala Phe Tyr cys Asp Gly Glu Cys Ser pne 
60 65 70 

CTT AAC GCC CAT ATG AAT GCC ACC AAC CAC GCT ATA GTT CAG ACT CTG 
Leu Asn Ala His Met Asn Ala Thr Asn His Ala lie Val Gin Thr Leu 
75 80 85 

GTT CAT CTG ATG TTT CCT GAC CAC GTA CCA AAG CCT TGT TGT GCT CCA 
55 His Leu Met Phe Pro Asp His Val Pro Lys Pro Cys Cys Ala Pro 
90 95 ' 100 

ACC AAA TTA AAT GCC ATC- TCT GTT CTG TAC TTT GAT GAC AGC TCC AAT 
Thr Lys Leu Asn Ala He Ser Val Leu Tyr Phe Asp Asp Ser Ser Asn 
110 115 A 

GTC ATT TTG AAA AAA TAT AGA AAT ATG GTA GTA CGC TCA TGT GGC TGC 
Val He Leu Lys Lys Tyr Arg Asn Met Val Val Arg Ser Cys Giy cys 
125 130 • 13 

CAC TAATATTAAA TAATATTGAT AATAACAAAA AGATCTGTAT TAAGGTTTAT 
His . 

GGCTGCAATA AAAAGCATAC TTTCAGACAA ACAGAAAAAA AAA 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 454 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met His Leu Thr Val Phe Leu Leu Lys Gly He Val Gly Phe Leu Trp 
-316 -315 "310 -305 

ser cys Trp Val Leu Val Gly Tyr Ala Lys Gly Gly Leu Gly Asp Asn g 
-300 -295 -290 

His Val His ser Ser Phe He Tyr Arg Arg Leu Arg Asn His Glu Arg 
-280 - 275 " 27 ° 

Arg Glu He Gin Arg Glu He Leu Ser He Leu Gly Leu Pro His Arg 
-265 ~ 260 



Ill 

Pro Arg Pro Phe Ser Pro Gly Lys Met Thr Asn Gin Ala Ser Ser Ala 
-250 -245 -240 

Pro Leu Phe Met Leu Asp Leu Tyr Asn Ala Glu Glu Asn Pro Glu Glu 
-235 -230 -225 

Ser Glu Tyr Ser Val Arg Ala Ser Leu Ala Glu Glu Thr Arg Gly Ala 
_22 ° -215 -210 -205 

Arg Lys Gly Tyr Pro Ala Ser Pro Asn Gly Tyr Pro Arg Arg He Gin 
-200 -195 -190 

Leu Ser Arg Thr Thr Pro Leu Thr Thr Gin Ser Pro Pro Leu Ala Ser 
-185 -180 -175 

Leu His Asp Thr Asn Phe Leu Asn Asp Ala Asp Met Val Met Ser Phe 
"170 -165 _ 160 

Val Asn Leu Val Glu Arg Asp Lys Asp Phe Ser His Gin Arg Arg His 
-155 -i5o _ 145 

T ^L Lys Glu Phe Ar 9 Pne As P Leu Thr Gln He Pro His Gly Glu Ala 
- 140 "135 -130 - 125 

Val Thr Ala Ala Glu Phe Arg He Tyr Lys Asp Arg Ser Asn Asn Arg 
- 12 0 -115 -no 

Phe Glu Asn Glu Thr He Lys He Ser He Tyr Gin He He Lys Glu 
-105 -loo -95 

Tyr Thr Asn Arg Asp Ala Asp Leu Phe Leu Leu Asp Thr Arg Lys Ala 
" 90 -85 ^ -80 

Gin Ala Leu Asp Val Gly Trp Leu Val Phe Asp He Thr Val Thr Ser 
" 75 -70 -65 

Asn His Trp Val He Asn Pro Gin Asn Asn Leu Gly Leu Gin Leu Cys 
" 60 "55 -50 -45 

Ala Glu Thr Gly Asp Gly Arg Ser He Asn Val Lys Ser Ala Gly Leu 
-40 -35 -3o 

Val Gly Arg Gin Gly Pro Gin Ser Lys Gin Pro Phe Met Val Ala Phe 
-25 -20 -15 

Phe Lys Ala Ser Glu Val Leu Leu Arg Ser Val Arg Ala Ala Asn Lys 
-10 _ 5 x 

Arg Lys Asn Gin Asn Arg Asn Lys Ser Ser Ser His Gin Asp Ser Ser 
5 1° 15 20 

Arg Met Ser Ser Val Gly Asp Tyr Asn Thr Ser Glu Gin Lys Gin Ala 

25 30 35 

Cys Lys Lys His Glu Leu Tyr Val Ser Phe Arg Asp Leu Gly Trp Gin 
40 45 50 

Asp Trp He He Ala Pro Glu Gly Tyr Ala Ala Phe Tyr Cys Asp Glv 
55 60 65 
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Glu Cys Ser Phe Pro Leu Asn Ala His Met Asn Ala Thr Asn His Ala 
70 75 80 

lie val Gin Thr Leu Val His Leu Met Phe Pro Asp His Val Pro Lys 

85 90 

Pro Cys Cys Ala Pro Thr Lys Leu Asn Ala He Ser Val Leu Tyr Phe 
105 110 

Asp Asp ser ser Asn Val He Leu Lys Lys Tyr Arg Asn Met Val Val 
120 125 3.30 

Arg Ser Cys Gly Cys His 
135 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 
(F) TISSUE TYPE: Human Heart 

(vii) IMMEDIATE SOURCE: t ^ ^ . , _ 

(A) LIBRARY: Human heart cDNA library stratagene catalog 

#936208 

(B) CLONE: hH38 

(viii) POSITION IN GENOME: 

(C) UNITS: bp 

(ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 8.. 850 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 427. .843 

(ix) FEATURE: 

(A) NAME/KEY: mRNA 

(B) LOCATION: 1..997 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GAATTCC GAG CCC CAT TGG AAG GAG TTC CGC TTT GAC CTG ACC CAG ATC 
Glu Pro His Trp Lys Glu Phe Arg Phe Asp Leu Thr Gin He 
-139 -135 "I 30 
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CCG GCT GGG GAG GCG GTC ACA GOT GCG GAG TTC CGG ATT TAG AAG GTG 
Pro Ala Gly Glu Ala Val Thr Ala Ala Glu Phe Arg lie Tyr Lys Val 
-125 -120 "115 -110 

CCC AGC ATC CAC CTG CTC AAC AGG ACC CTC CAC GTC AGC ATG TTC CAG 
Pro Ser lie His Leu Leu Asn Arg Thr Leu His Val Ser Met Phe Gin 
-105 -100 -95 

GTG GTC CAG GAG CAG TCC AAC AGG GAG TCT GAC TTG TTC TTT TTG GAT 
Val Val Gin Glu Gin Ser Asn Arg Glu Ser Asp Leu Phe Phe Leu Asp 
-90 -85 -80 

CTT CAG ACG CTC CGA GCT GGA GAC GAG GGC TGG CTG GTG CTG GAT GTC 
Leu Gin Thr Leu Arg Ala Gly Asp Glu Gly Trp Leu Val Leu Asp Val 
-75 -70 -65 

ACA GCA GCC AGT GAC TGC TGG TTG CTG AAG CGT CAC AAG GAC CTG GGA 
Thr Ala Ala Ser Asp Cys Trp Leu Leu Lys Arg His Lys Asp Leu Gly 
-60 -55 -50 

CTC CGC CTC TAT GTG GAG ACT GAG GAT GGG CAC AGC GTG GAT CCT GGC 
Leu Arg Leu Tyr Val Glu Thr Glu Asp Gly His Ser Val Asp Pro Gly 
-45 -40 -35 -30 

CTG GCC GGC CTG CTG GGT CAA CGG GCC CCA CGC TCC CAA CAG CCT TTC 
Leu Ala Gly Leu Leu Gly Gin Arg Ala Pro Arg Ser Gin Gin Pro Phe 
-25 -20 -15 

GTG GTC ACT TTC TTC AGG GCC AGT CCG AGT CCC ATC CGC ACC CCT CGG 
Val Val Thr 'Phe Phe Arg Ala Ser Pro Ser Pro lie Arg Thr Pro Arg 
-10 -5 1 

GCA GTG AGG CCA CTG AGG AGG AGG CAG CCG AAG AAA AGC AAC GAG CTG 
Ala Val Arg Pro Leu Arg Arg Arg Gin Pro Lys Lys Ser Asn Glu Leu 
5 10 15 

CCG CAG GCC AAC CGA CTC CCA GGG ATC TTT GAT GAC GTC CAC GGC TCC 
Pro Gin Ala Asn Arg Leu Pro Gly lie Phe Asp Asp Val His Gly Ser 
20 25 30 ( 35 

CAC GGC CGG CAG GTC TGC CGT CGG CAC GAG CTC TAC GTC AGC TTC CAG 
His Gly Arg Gin Val Cys Arg Arg His Glu Leu Tyr Val Ser Phe Gin 

40 45 50 

GAC CTT GGC TGG CTG GAC TGG GTC ATC GCC CCC CAA GGC TAC TCA GCC 
Asp Leu Gly Trp Leu Asp Trp Val lie Ala Pro Gin Gly Tyr Ser Ala 
55 60 65 

TAT TAC TGT GAG GGG GAG TGC TCC TTC CCG CTG GAC TCC TGC ATG AAC 
Tyr Tyr Cys Glu Gly Glu Cys Ser Phe Pro Leu Asp Ser Cys Met Asn 
70 75 80 

GCC ACC AAC CAC GCC ATC CTG CAG TCC CTG GTG CAC CTG ATG AAG CCA 
Ala Thr Asn His Ala lie Leu Gin Ser Leu Val His Leu Met Lys Pro 
85 90 95 

AAC GCA GTC CCC AAG GCG TGC TGT GCA CCC ACC AAG CTG AGC GCC ACC 
Asn Ala Val Pro Lys Ala Cys Cys Ala Pro Thr Lys Leu Ser Ala Thr 
100 105 110 115 



WO 93/09229 



114 



TCT GTG CTC TAG TAT GAC AGC AGC AAC AAC GTC ATC CTC CGC AAG CAC 
Ser val Leu Tyr Tyr Asp Ser Ser Asn Asn Val He Leu Arg Lys His 



120 



CGC AAC ATG GTG GTC AAG GCC TGC GGC TGC CAC TGAGTCAGCC CGCCCAGCCC 
Arq Asn Met Val Val Lys Ala Cys Gly Cys His 
135 140 

TACTGCAGCC ACCCTTCTCA TCTGGATCGG GCCCTGCAGA GGCAGAAAAC CCTTAAATGC 
TGTCACAGCT CAAGCAGGAG TGTCAGGGGC CCTCACTCTC GGTGCCTACT TCCTGTCAGG 
CTTCTGGGAA TTC 

(2) INFORMATION FOR SEQ ID NO: 12; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 281 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Glu Pro His Trp Lys Glu Phe Arg Phe Asp Leu Thr Gin He Pro Ala 
-139 -135 - 130 -■ UJD 

Gly Glu Ala Val Thr Ala Ala Glu Phe Arg He Tyr Lys Val Pro Ser 
-120 "US ; "HO 

He His Leu Leu Asn Arg Thr Leu His Val Ser Met Phe Gin Val Val 
-105 -100 ~ 95 

Gin Glu Gin Ser Asn Arg Glu Ser Asp Leu Phe Phe Leu Asp Leu Gin 
-90 -85 " 80 

Thr Leu Arg Ala Gly Asp Glu Gly Trp Leu Val Leu Asp Val Thr Ala 
-75 -70 -65 ~ 6U 

Ala Ser Asp cys Trp Leu Leu Lys Arg His Lys Asp Leu Gly Leu Arg 
-55 -50 -45 

Leu Tyr Val Glu Thr Glu Asp Gly His Ser Val Asp Pro Gly Leu Ala 
-40 -35 "30 

Gly Leu Leu Gly Gin Arg Ala Pro Arg Ser Gin Gin Pro Phe Val Val 
-25 -20 -I 5 

Thr Phe Phe Arg Ala Ser Pro Ser Pro He Arg Thr Pro Arg Ala Val 
-10 -5 15 

Arg Pro Leu Arg Arg Arg Gin Pro Lys Lys Ser Asn Glu Leu Pro Gin 

10 15 20 

Ala Asn Arg Leu Pro Gly He Phe Asp Asp Val His Gly Ser His Gly 
25 30 35 

Arg Gin Val Cys Arg Arg His Glu Leu Tyr Val Ser Phe Gin Asp Leu 



817 
870 

930 
990 
1003 
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40 45 50 

Gly Trp Leu Asp Trp Val He Ala Pro Gin Gly Tyr Ser Ala Tyr Tyr 
55 60 65 

Cys Glu Gly Glu Cys Ser Phe Pro Leu Asp Ser Cys Met Asn Ala Thr 
70 75 '80 85 

Asn His Ala He Leu Gin Ser Leu Val His Leu Met Lys Pro Asn Ala 

90 95 100 

Val Pro Lys Ala Cys Cys Ala Pro Thr Lys Leu Ser Ala Thr Ser Val 
105 110 115 

Leu Tyr Tyr Asp Ser Ser Asn Asn Val He Leu Arg Lys His Arg Asn 
120 125 130 

Met Val Val Lys Ala Cys Gly Cys His 
135- 140 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3623 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(v*ii) IMMEDIATE SOURCE: 

(B) CLONE: pALBP2-7 81 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2724.. 3071 

(ix) FEATURE: 

(A) NAME/ KEY : terminator 

(B) LOCATION: 3150.. 3218 

(ix) FEATURE: 

(A) NAME/KEY: RBS 

(B) LOCATION: 2222.. 2723 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GACGAAAGGG CCTCGTGATA CGCCTATTTT TATAGGTTAA TGTCATGATA ATAATGGTTT 60 

CTTAGACGTC AGGTGGCACT TTTCGGGGAA ATGTGCGCGG AACCCCTATT TGTTTATTTT 12 0 

TCTAAATACA TTCAAATATG TATCCGCTCA TGAGACAATA ACCCTGATAA ATGCTTCAAT 180 

AATATTGAAA AAGGAAGAGT ATGAGTATTC AACATTTCCG TGTCGCCCTT ATTCCCTTTT 2 40 

TTGCGGCATT TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 3 00 

CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAACT GGATCTCAAC AGCGGTAAGA 3 60 
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TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGAT GAGCACTTTT AAAGTTCTGC 
TATGTGGCGC GGTATTATCC CGTATTGACG CCGGGCAAGA GCAACTCGGT CGCCGCATAC 
ACTATTCTCA GAATGACTTG GTTGAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG 
GCATGACAGT AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCGGCCA 
ACTTACTTCT GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG CACAACATGG 
GGGATCATGT AACTCGCCTT GATCGTTGGG AACCGGAGCT GAATGAAGCC ATACCAAACG 
ACGAGCGTGA CACCACGATG CCTGTAGCAA TGGCAACAAC GTTGCGCAAA CTATTAACTG 
GCGAACTACT TACTCTAGCT TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG 
TTGCAGGACC ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG 
GAGCCGGTGA GCGTGGGTCT CGCGGTATCA TTGCAGCACT GGGGCCAGAT GGTAAGCCCT 
CCCGTATCGT AGTTATCTAC ACGACGGGGA GTCAGGCAAC TATGGATGAA CGAAATAGAC 
AGATCGCTGA GATAGGTGCC TCACTGATTA AGCATTGGTA ACTGTCAGAC CAAGTTTACT 
CATATATACT TTAGATTGAT TTAAAACTTC ATTTTTAATT TAAAAGGATC TAGGTGAAGA 
TCCTTTTTGA TAATCTCATG ACCAAAATCC CTTAACGTGA GTTTTCGTTC CACTGAGCGT 
CAGACCCCGT AGAAAAGATC AAAGGATCTT CTTGAGATCC TTTTTTTCTG CGCGTAATCT 
GCTGCTTGCA AACAAAAAAA CCACCGCTAC CAGCGGTGGT TTGTTTGCCG GATCAAGAGC 
TACCAACTCT TTTTCCGAAG GTAACTGGCT TCAGCAGAGC GCAGATACCA AATACTGTCC 
TTCTAGTGTA GCCGTAGTTA GGCCACCACT TCAAGAACTC TGTAGCACCG CCTACATACC 
TCGCTCTGCT AATCCTGTTA CCAGTGGCTG CTGCCAGTGG CGATAAGTCG TGTCTTACCG 
GGTTGGACTC AAGACGATAG TTACCGGATA AGGCGCAGCG GTCGGGCTGA ACGGGGGGTT 
CGTGCACACA GCCCAGCTTG GAGCGAACGA CCTACACCGA ACTGAGATAC CTACAGCGTG 
AGCATTGAGA AAGCGCCACG CTTCCCGAAG GGAGAAAGGC GGACAGGTAT CCGGTAAGCG 
GCAGGGTCGG AACAGGAGAG CGCACGAGGG AGCTTCCAGG GGGAAACGCC TGGTATCTTT 
ATAGTCCTGT CGGGTTTCGC CACCTCTGAC TTGAGCGTCG ATTTTTGTGA TGCTCGTCAG 
GGGGGCGGAG CCTATGGAAA AACGCCAGCA ACGCGGCCTT TTTACGGTTC CTGGCCTTTT 
GCTGGCCTTT TGCTCACATG TTCTTTCCTG CGTTATCCCC TGATTCTGTG GATAACCGTA 
TTACCGCCTT TGAGTGAGCT GATACCGCTC GCCGCAGCCG AACGACCGAG CGCAGCGAGT 
CAGTGAGCGA GGAAGCGGAA GAGCGCCCAA TACGCAAACC GCCTCTCCCC GCGCGTTGGC 
CGATTCATTA ATGCAGAATT GATCTCTCAC CTACCAAACA ATGCCCCCCT GCAAAAAATA 
AATTCATATA AAAAACATAC AGATAACCAT CTGCGGTGAT AAATTATCTC TGGCGGTGTT 



420 
480 
540 
600 
660 
720 I 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
210Q.- 
2160 
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GACATAAATA 


CCACTGGCGG 


TGATACTGAG 


CACATCAGCA 


GGACGCACTG 


ACCACCATGA 


2220 


AGGTGACGCT 


CTTAAAAATT 


AAGCCCTGAA 


GAAGGGCAGC 


ATTCAAAGCA 


GAAGGCTTTG 


2280 


GGGTGTGTGA 


TACGAAACGA 


AGCATTGGCC 


GTAAGTGCGA 


TTCCGGATTA 


GCTGCCAATG 


2340 


TGCCAATCGC 


GGGGGGTTTT 


CGTTCAGGAC 


TACAACTGCC 


ACACACCACC 


AAAGCTAACT 


2400 


GACAGGAGAA 


TCCAGATGGA 


TGCACAAACA 


CGCCGCCGCG 


AACGTCGCGC 


AGAGAAACAG 


2460 


GCTCAATGGA 


AAGCAGCAAA 


TCCCCTGTTG 


GTTGGGGTAA 


GCGCAAAACC 


AGTTCCGAAA 


2520 


GATTTTTTTA 


ACTATAAACG 


CTGATGGAAG 


CGTTTATGCG 


GAAGAGGTAA 


AGCCCTTCCC 


2580 


GAGTAACAAA 


AAAACAACAG 


CATAAATAAC 


CCCGCTCTTA 


CACATTCCAG 


CCCTGAAAAA 


2640 


GGGCATCAAA 


TTAAACCACA 


CCTATGGTGT 


ATGCATTTAT 


TTGCATACAT 


TCAATCAATT 


2700 


GTTATCTAAG 


GAAATACTTA 


CAT ATG CAA GCT AAA CAT AAA CAA CGT AAA 


2750 



Met Gin Ala Lys His Lys Gin Arg Lys 
1 5 

CGT CTG AAA TCT AGC TGT AAG AGA CAC CCT TTG TAC GTG GAC TTC AGT 2798 
Arg Leu Lys Ser Ser Cys Lys Arg His Pro Leu Tyr Val Asp Phe Ser 
10 15 20 25 

GAC GTG GGG TGG AAT GAC TGG ATT GTG GCT CCC CCG GGG TAT CAC GCC 2 84 6 

Asp Val Gly Trp Asn Asp Trp He Val Ala Pro Pro Gly Tyr His Ala 

30 35 40 

TTT TAC TGC CAC GGA GAA TGC CCT TTT CCT CTG GCT GAT CAT CTG AAC 2894 
Phe Tyr Cys His Gly Glu Cys Pro Phe Pro Leu Ala Asp His Leu Asn 
45 50 55 

TCC ACT AAT CAT GCC ATT GTT CAG ACG TTG GTC AAC TCT GTT AAC TCT 2 94 2 

Ser Thr Asn His Ala He Val Gin Thr Leu Val Asn Ser Val Asn Ser 
60 65 70 

AAG ATT CCT AAG GCA TGC TGT GTC CCG ACA GAA CTC AGT GCT ATC TCG 2990 
Lys He Pro Lys Ala Cys Cys Val Pro Thr Glu Leu Ser Ala He Ser 
75 80 85 

ATG CTG TAC CTT GAC GAG AAT GAA AAG GTT GTA TTA AAG AAC TAT CAG 3 03 8 

Met Leu Tyr Leu Asp Glu Asn Glu Lys Val Val Leu Lys Asn Tyr Gin 
90 95 100 105 

GAC ATG GTT GTG GAG GGT TGT GGG TGT CGC TAGTACAGCA AAATTAAATA 3 088 

Asp Met Val Val Glu Gly Cys Gly Cys Arg 
110 115 



CATAAATATA 


TATATATATA 


TATATTTTAG 


AAAAAAGAAA 


AAAATCTAGA 


GTCGACCTGC 


3148 


AGTAATCGTA 


CAGGGTAGTA 


CAAATAAAAA 


AGGCACGTCA 


GATGACGTGC 


CTTTTTTCTT 


3208 


GTGAGCAGTA 


AGCTTGGCAC 


TGGCCGTCGT 


TTTACAACGT 


CGTGACTGGG 


AAAACCCTGG 


3268 


CGTTACCCAA 


CTTAATCGCC 


TTGCAGCACA 


TCCCCCTTTC 


GCCAGCTGGC 


GTAATAGCGA 


3328 


AGAGGCCCGC 


ACCGATCGCC 


CTTCCCAACA 


GTTGCGCAGC 


CTGAATGGCG 


AATGGCGCCT 


3388 
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GATGCGGTAT TTTCTCCTTA CGCATCTGTG CGGTATTTCA CACCGCATAT ATGGTGCACT 344 8 

CTCAGTACAA TCTGCTCTGA TGCCGCATAG TTAAGCCAGC CCCGACACCC GCCAACACCC 3508 

GCTGACGCGC CCTGACGGGC TTGTCTGCTC CCGGCATCCG CTTACAGACA AGCTGTGACC 3568 

GTCTCCGGGA GCTGCATGTG TCAGAGGTTT TCACCGTCAT CACCGAAACG CGCGA 3 623 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Gin Ala Lys His Lys Gin Arg Lys Arg Leu Lys Ser Ser Cys Lys 
1 5 10 15 

Arg His Pro Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asn Asp Trp 
20 25 30 

He Val Ala Pro Pro Gly Tyr His Ala Phe Tyr Cys His Gly Glu Cys 
35 40 45 

Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr Asn His Ala He Val 
50 55 60 

Gin Thr Leu Val Asn Ser Val Asn Ser Lys He Pro Lys Ala Cys Cys 
65 70 75 80 

Val Pro Thr Glu Leu Ser Ala He Ser Met Leu Tyr Leu Asp Glu Asn 

85 90 95 

Glu Lys Val Val Leu Lys Asn Tyr Gin Asp Met Val Val Glu Gly Cys 
100 105 110 

Gly Cys Arg 
115 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
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CATGGGCAGC TGAG 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GAGGGTTGTG GGTGTCGCTA GTGAGTCGAC TACAGCAAAT T 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucieic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GGATGTGGGT GCCGCTGACT CTAGAGTCGA CGGAATTC 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
AATTCACCAT GATTCCTGGT AACCGAATGC T 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GTGGTACTAA GGACCATTGG CTTAC 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
CGACCTGCAG CCATGCATCT GACTGTA 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
TGCCTGCAGT TTAATATTAG TGGCAGC 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
CGACCTGCAG CCACC 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
TCGACCCACC ATGCCGGGGC TGGGGCGGAG GGCGCAGTGG CTGTGCTGGT GGTGGGGGCT 
GTGCTGCAGC TGCTGCGGGC C 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CGCAGCAGCT GCACAGCAGC CCCCACCACC AGCACAGCCA CTGCGCCCTC CGCCCCAGCC 
CCGGCATGGT GGG 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
TCGACTGGTT T 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
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CGAAACCAG 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 
TCGACAGGCT CGCCTGCA 
(2) INFORMATION FOR SEQ ID NO: 28 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 
GTCCGAGCGG 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 
CAGGTCGACC CACCATGCAC GTGCGCTCA 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 
TCTGTCGACC TCGGAGGAGC TAGTGGC 
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WHAT IS CLAIMED IS: 

1. A method for producing a heterodimeric 
protein having bone stimulating activity comprising 
culturing a selected host cell containing a sequence 
encoding a first selected BMP or fragment thereof and a 
sequence encoding a second selected BMP or fragment 
thereof, said sequences each being under the control of a 
suitable regulatory sequence capable of directing co- 
expression of said proteins, and isolating said 
heterodimeric protein from the culture medium. 

2. The method according to claim 1 wherein 
said first BMP or fragment thereof is present on a first 
vector transfected into said host cell and said second 
BMP or fragment thereof is present on a second vector 
transfected into said host cell. 

3. The method according to claim 1 wherein 
both said BMPs or fragments thereof are incorporated into 
a chromosome of said host cell. 

4. The method according to claim 1 wherein 
both BMPs or fragments thereof are present on a single 
vector. 

5. The method according to claim 2 wherein 
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more than a single copy of the gene encoding each said 
BMP or fragment thereof is present on each vector* 

6. The method according to claim 1 wherein 
said host cell is a hybrid cell prepared by culturing two 
fused selected , stable host cells, each host cell 
transfected with a sequence encoding a selected first or 
second BMP or fragment thereof, said sequences under the 
control of a suitable regulatory sequence capable of 
directing expression of each protein or fragment. 

7. The method according to claim 1 wherein 
said host cell is a mammalian cell. 

8. The method according to claim 1 wherein 
said host cell is an insect cell. 

9. The method according to claim 1 wherein 
said host cell is a yeast cell. 

10. A method for producing a heterodimeric 
protein having bone stimulating activity in a bacterial 
cell comprising culturing a selected host cell containing 
a sequence encoding a first selected BMP or fragment 
thereof under the control of a suitable regulatory 
sequence capable of directing expression of the protein 
or protein fragment under conditions suitable for the 
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formation of a soluble, monomeric protein; culturing a 
selected host cell containing a sequence encoding a 
second selected BMP or fragment thereof under the contr. 
of a suitable regulatory sequence capable of directing 
expression of the protein or protein fragment under sai< 
conditions to form a second soluble, monomeric protein; 
and mixing said soluble monomeric proteins under 
conditions permitting the formation of dimeric proteins 
associated by at least one covalent disulfide bond; 
isolating from the mixture a heterodimeric protein. 

11. The method according to claim 10 wherein 
said host cell is E. coli. 



12. The method according to claim 10 wherein 
said conditions comprise treating said protein with a 
15 solubilizing agent. 



13. A recombinant heterodimeric protein having 
bone stimulating activity comprising a first protein or 
fragment of BMP-2 in association with a second protein or 
fragment thereof selected from the group consisting of 
BMP-5, BMP- 6, BMP-7 and BMP-8. 

14. The protein according to claim 13 wherein 
said second protein is BMP-5. 
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15. The protein according to claim 13 wherein 
said second protein is BMP-6. 

16. The protein according to claim 13 wherein 
said second protein is BMP-7. 

17. The protein according to claim 13 wherein 
said second protein is BMP-8. 

18. A recombinant heterodimeric protein having 
bone stimulating activity comprising a protein or 
fragment of BMP-4 in association with a second protein or 
fragment thereof selected from the group consisting of 
BMP-5, BMP-6, BMP-7 and BMP-8. 

19. The protein according to claim 18 wherein 
said second protein is BMP-5. 

20. The protein according to claim 18 wherein 
said second protein is BMP-6. 

21. The protein according to claim 18 wherein 
said second protein is BMP-7. 

22. The protein according to claim 18 wherein 
said second protein is BMP-8. 
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23. A recombinant heterodimeric protein having 
bone stimulating activity comprising a protein or 
fragment of a first BMP in association with a second 
protein or fragment of a second BMP produced by co- 

5 expressing said proteins in a selected host cell. 

24. The protein according to claim 23 wherein 
said first BMP is BMP-2 and said second BMP is BMP-7. 

25. A cell line comprising a nucleotide 
sequence encoding a first BMP or fragment thereof under 

10 control of a suitable expression regulatory system and a 
nucleotide sequence encoding a second BMP or fragment 
thereof under control of a suitable expression regulatory 
system, said regulatory systems capable of directing the 
co-expression of said BMPs or fragments thereof and the 

15 formation of heterodimeric protein. 

26. The cell line according to claim 25 
wherein said nucleotide sequences encoding said first and 
second BMP proteins are present in a single DMA molecule. 

27. The cell line according to claim 25 

20 wherein said nucleotide sequence encoding said first BMP 
is present on a first DNA molecule and said nucleotide 
sequence encoding said second BMP is present on a second 
DNA molecule. 
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28. The cell line according to claim 26 
wherein said single DNA molecule comprises a first 
transcription unit containing a gene encoding a first BMP 
or fragment thereof and a second transcription unit 
containing a gene encoding a second BMP or fragment 
thereof . 

29. The cell line according to claim 26 
wherein said single DNA molecule comprises a single 
transcription unit containing multiple copies of said 
gene encoding said first BMP or fragments thereof and 
multiple copies of said gene encoding said second BMP or 
fragments thereof. 

30. A DNA molecule comprising a sequence 
encoding a first selected BMP or fragment thereof and a 
sequence encoding a second selected BMP or fragment 
thereof, said sequences under the control of at least one 
suitable regulatory sequence capable of directing co- 
expression of each BMP or fragment thereof. 

31. The molecule according to claim 30 
comprising a first transcription unit containing a gene 
encoding a first BMP or fragment thereof and a second 
transcription unit containing a gene encoding a second 
BMP or fragment thereof. 
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32. The molecule according to claim 30 
comprising a single transcription unit containing 
multiple copies of said gene encoding said first BMP or 
fragments thereof and multiple copies of said gene 

5 encoding said second BMP or fragments thereof. 

33. The protein according to claim 23 wherein 
said first BMP is BMP-2 and said second BMP is BMP-6. 

34. A recombinant BMP-2 homodimer having bone 
stimulating activity said homodimer produced in E^. coli. 

10 35. A method for producing a homodimeric BMP-2 

protein having bone stimulating activity said method 
comprising culturing JjL. coli host cells and isolating and 
purifying said protein from the resulting culture medium. 

36. A recombinant heterodimeric protein having 
15 bone stimulating activity comprising a first protein or 

fragment of BMP-2 in association with a second protein or 
fragment of BMP-2 . 
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FIGURE 1A 



10 20 30 40 50 60 70 

GTCGACTCTA GAGTGTGTGT CAGCACTTGG CTGGGGACTT CTTGAACTTG CAGGGAGAAT AACTTGCGCA 



80 90 100 110 120 130 140 

CCCCACTTTG CGCCGGTGCC TTTGCCCCAG CGGAGCCTGC TTCGCCATCT CCGAGCCCCA CCGCCCCTCC 



150 160 170 180 190 200 210 

ACTCCTCGGC CTTGCCCGAC ACTGAGACGC TGTTCCCAGC GTGAAAAGAG AGACTGCGCG GCCGGCACCC 



220 230 240 250 260 270 280 

GGGAGAAGGA GGAGGCAAAG AAAAGGAACG GACATTCGGT CCTTGCGCCA GGTCCTTTGA CCAGAGTTTT 



290 300 310 320 330 340 350 

TCCATGTGGA CGCTCTTTCA ATGGACGTGT CCCCGCGTGC TTCTTAGACG GACTGCGGTC TCCTAAAGGT 



(1) 370 385 400 

CGACC ATG GTG GCC GGG ACC CGC TGT CTT CTA GCG TTG CTG CTT CCC CAG GTC 
MET Val Ala Gly Thr Arg Cys Leu Leu Ala Leu Leu Leu Pro Gin Val 

415 430 445 

CTC CTG GGC GGC GCG GCT GGC CTC GTT CCG GAG CTG GGC CGC AGG AAG TTC GCG 

Leu Leu Gly Gly Ala Ala Gly Leu Val Pro Glu Leu Gly Arg Arg Lys Phe Ala 

(24) 

460 475 490 505 

GCG GCG TCG TCG GGC CGC CCC' TCA TCC CAG CCC TCT GAC GAG GTC CTG AGC GAG 

Ala Ala Ser Ser Gly Arg Pro Ser Ser Gin Pro Ser Asp Glu Val Leu Ser Glu 

520 535 550 565 

TTC GAG TTG CGG CTG CTC AGC ATG TTC GGC CTG AAA CAG AGA CCC ACC CCC AGC 
Phe Glu Leu Arg Leu Leu Ser MET Phe Gly Leu Lys Gin Arg Pro Thr Pro Ser 

580 595 610 

AGG GAC GCC GTG GTG CCC CCC TAC ATG CTA GAC CTG TAT CGC AGG CAC TCA GGT 
Arg Asp Ala Val Val Pro Pro Tyr MET Leu Asp Leu Tyr Arg Arg His Ser Gly 

625 640 655 670 

CAG CCG GGC TCA CCC GCC CCA GAC CAC CGG TTG GAG AGG GCA GCC AGC CGA GCC 

Gin Pro Gly Ser Pro Ala Pro Asp His Arg Leu Glu Arg Ala Ala Ser Arg Ala 
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FIGURE IB 



685 700 715 

AAC ACT GTG CGC AGC TTC CAC CAT GAA GAA TCT TTG GAA GAA CTA CCA GAA ACG 
Asn Thr Val Arg Ser Phe His His Glu Glu Ser Leu Glu Glu Leu Pro Glu Thr 

730 745 760 775 

AGT GGG AAA ACA ACC CGG AGA TTC TTC TTT AAT TTA AGT TCT ATC CCC ACG GAG 

Ser Gly Lys Thr Thr Arg Arg Phe Phe Phe Asn Leu Ser Ser He Pro Thr Glu 

790 805 820 835 

GAG TTT ATC ACC TCA GCA GAG CTT CAG GTT TTC CGA GAA CAG ATG CAA GAT GCT 
Glu Phe He Thr Ser Ala Glu Leu Gin Val Phe Arg Glu Gin MET Gin Asp Ala 

850 865 880 

TTA GGA AAC AAT AGC AGT TTC CAT CAC CGA ATT AAT ATT TAT GAA ATC ATA AAA 
Leu Gly Asn Asn Ser Ser Phe His His Arg He Asn He Tyr Glu He He Lys 

89 5 910 925 940 

CCT GCA ACA GCC AAC TCG AAA TTC CCC GTG ACC AGA CTT TTG GAC ACC AGG TTG 
Pro Ala Thr Ala Asn Cer Lys Phe Pro Val Thr Arg Leu Leu Asp Thr Arg Leu 

955 970 985 

GTG AAT CAG AAT GCA AGC AGG TGG GAA AGT TTT GAT GTC ACC CCC GCT GTG ATG 
Val Asn Gin Asn Ala Ser Arg Trp Glu Ser Phe Asp Val Thr Pro Ala Val MET 

1000 1015 1030 1045 

CGG TGG ACT GCA CAG GGA CAC GCC AAC CAT GGA TTC GTG GTG GAA GTG GCC CAC 

Arg Trp Thr Ala Gin Gly His Ala Asn His Gly Phe Val Val Glu Val Ala His 

1Q 60 ■ 1075 1090 H05 

TTG GAG GAG AAA CAA GGT GTC TCC AAG AGA CAT GTT AGG ATA AGC AGG TCT TTG 
Leu Glu Glu Lys Gin Gly Val Ser Lys Arg His Val Arg He Ser Arg Ser Leu 

(249) 

1120 1135 1150 

CAC CAA GAT GAA CAC AGC TGG TCA CAG ATA AGG CCA TTG CTA GTA ACT TTT GGC 
His Gin Asp Glu His Ser Trp Ser Gin He Arg Pro Leu Leu Val Thr Phe Gly 

(266) 

ll 65 1180 1195 1210 

CAT GAT GGA AAA GGG CAT CCT CTC CAC AAA AGA GAA AAA CGT CAA GCC AAA CAC 
His Asp Gly Lys Gly His Pro Leu His Lys Arg Glu Lys Arg Gin Ala Lys His 

(283) 

1225 1240 1255 

AAA CAG CGG AAA CGC CTT AAG TCC AGC TGT AAG AGA CAC CCT TTG TAC GTG GAC 
Lys Gin Arg Lys Arg Leu Lys Ser Ser Cys Lys Arg His Pro Leu Tyr Val Asp 

(296) 

1270 1285 1300 1315 

TTC AGT GAC GTG GGG TGG AAT GAC TGG ATT GTG GCT CCC CCG GGG TAT CAC GCC 
Phe Ser Asp Val Gly Trp Asn Asp Trp He Val Ala Pro Pro Gly Tyr His Ala 
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FIGURE 1C 
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1540(396) 1553 1563 1573 1583 1593 1603 

TGT CGC TAGTACAGCA AAATTAAATA CATAAATATA TATATATATA TATATTTTAG AAAAAAGAAA 
Cys Arg 
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FIGURE 2A 



10 20 30 40 50 60 70 

CTCTAGAGGG -C AG AG G AGG A GGGAGGGAGG GAAGGAGCGC GGAGCCCGGC CCGGAAGCTA GGTGAGTGTG 

80 90 100 110 120 130 140 

GCATCCGAGC TGAGGGACGC GAGCCTGAGA CGCCGCTGCT GCTCCGGCTG AGTATCTAGC TTGTCTCCCC 

„„ m 150 160 170 180 190 200 210 

GATGGGATTC CCGTCCAAGC TATCTCGAGC CTGCAGCGCC ACAGTCCCCG GCCCTCGCCC AGGTTCACTG 

220 230 240 250 260 270 280 

CAACCGTTCA GAGGTCCCCA GGAGCTGCTG CTGGCGAGCC CGCTACTGCA GGGACCTATG GAGCCATTCC 

290 300 310 320 330 340 350 

GTAGTGCCAT CCCGAGCAAC GCACTGCTGC AGCTTCCCTG AGCCTTTCCA GCAAGTTTGT TCAAGATTGG 

360 370 380 390 400 (1) 

CTGTCAAGAA TCATGGACTG TTATTATATG CCTTGTTTTC TGTCAAGACA CC ATG ATT CCT 

MET lie Pro 

417 432 447 462 

ril it* P A tl2. P G ATG GTC GTT TTA TGC CAA GTC CTG CTA GGA GGC GCG 

Gly Asn Arg MET Leu MET Val Val Leu Leu Cys Gin Val Leu Leu Gly Gly Ala 

477 492 507 

«t T G f T AGT TTG ATA CCT GAG ACG GGG AAG AAA AAA GTC GCC GAG ATT CAG 
Ser His Ala Ser Leu He Pro Glu Thr Gly Lys Lys Lys Val Ala Glu He Gin 

522 537 552 567 

r GC . » AC G ? G G ? A GGA CGC CGC TCA GGG CAG AGC CAT GAG CTC CTG CGG GAC TTC 
Gly His Ala Gly Gly Arg Arg Ser Gly Gin Ser His Glu Leu Leu Arg Asp Phe 

582 59 7 612 627 

G f G r TT CTG CAG ATG TTT GGG CTG CGC CGC CGC CCG CAG CCT AGC AAG 

Glu Ala Thr Leu Leu Gin MET Phe Gly Leu Arg Arg Arg Pro Gin Pro Ser Lys 

642 657 672 

AGT GCC GTC ATT CCG GAC TAC ATG CGG GAT CTT TAC CGG CTT CAG TCT GGG GAG 
Ser Ala Val He Pro Asp Tyr MET Arg Asp Leu Tyr Arg Leu Gin Ser Gly Glu 
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FIGURE 2B 



687 702 717 732 

GAG GAG GAA GAG CAG ATC CAC AGC ACT GGT CTT GAG TAT CCT GAG CGC CCG GCC 
Glu Glu Glu Glu Gin He His Ser Thr Gly Leu Glu Tyr Pro Glu Arg Pro Ala 

747 762 777 

AGC CGG GCC AAC ACC GTG AGG AGC TTC CAC CAC GAA GAA CAT CTG GAG AAC ATC 
Ser Arg Ala Asn Thr Val Arg Ser Phe His His Glu Glu His Leu Glu Asn He 

792 807 822 837 

CCA GGG ACC AGT GAA AAC TCT GCT TTT CGT TTC CTC TTT AAC CTC AGC AGC ATC 

Pro Gly Thr Ser Glu Asn Ser Ala Phe Arg Phe Leu Phe Asn Leu Ser Ser He 

852 867 882 897 

CCT GAG AAC GAG GTG ATC TCC TCT GCA GAG CTT CGG CTC TTC CGG GAG CAG GTG 
Pro Glu Asn Glu Val He Ser Ser Ala Glu Leu Arg Leu Phe Arg Glu Gin Val 

912 927 942 

GAC CAG GGC CCT GAT TpG GAA AGG GGC TTC CAC CGT ATA AAC ATT TAT GAG GTT 
Asp Gin Gly Pro Asp Trp Glu Arg Gly Phe His Arg He Asn He Tyr Glu Val 

957 972 987 1002 

blZ ^ G CCC CCA GCA GAA GTG GTG CCT GGG CAC CTC ATC ACA CGA CTA CTG GAC 
MET Lys Pro Pro Ala Glu Val Val Pro Gly His Leu He Thr Arg Leu Leu Asp 

1017 1032 1047 

ACG AGA CTG GTC CAC CAC AAT GTG ACA CGG TGG GAA ACT TTT GAT GTG AGC CCT 
Thr Arg Leu Val His His Asn Val Thr Arg Trp Glu Thr Phe Asp Val Ser Pro 

1062 1077 1092 1107 

GCG GTC CTT CGC TGG ACC CGG GAG AAG CAG CCA AAC TAT GGG CTA GCC ATT GAG 

Ala Val Leu Arg Trp Thr Arg Glu Lys Gin Pro Asn Tyr Gly Leu Ala He Glu 

1122 H37 H52 H67 

GTG ACT CAC CTC CAT CAG ACT CGG ACC CAC CAG GGC CAG CAT GTC AGG ATT AGC 
Val Thr His Leu His Gin Thr Arg Thr His Gin Gly Gin His Val Arg He Ser 

1182 H97 1212 

CGA TCG TTA CCT CAA GGG AGT GGG AAT TGG GCC CAG CTC CGG CCC CTC CTG GTC 
Arg Ser Leu Pro Gin Gly Ser Gly Asn Trp Ala Gin Leu Arg Pro Leu Leu Val 

1227 1242 1257 1272 

ACC TTT GGC CAT GAT GGC CGG GGC CAT GCC TTG ACC CGA CGC CGG AGG GCC AAG 

Thr Phe Gly His Asp Gly Arg Gly His Ala Leu Thr Arg Arg Arg Arg Ala Lys 

1287 1302 1317 

CGT AGC CCT AAG CAT CAC TCA CAG CGG GCC AGG AAG AAG AAT AAG AAC TGC CGG 
Arg Ser Pro Lys His His Ser Gin Arg Ala Arg Lys Lys Asn Lys Asn Cys Arg 

(293 ) 
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FIGURE 2C 



1332 1347 1362 1377 

CGC CAC TCG CTC TAT GTG GAC TTC AGC GAT GTG GGC TGG AAT GAC TGG ATT GTG 

Arg His Ser* Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asn Asp Trp He Val 

1392 1407 1422 1437 

GCC CCA CCA GGC TAC CAG GCC TTC TAC TGC CAT GGG GAC TGC CCC TTT CCA CTG 
Ala Pro Pro Gly Tyr Gin Ala Phe Tyr Cys His Gly Asp Cys Pro Phe Pro Leu 

1452 1467 1482 

GCT GAC CAC CTC AAC TCA ACC AAC CAT GCC ATT GTG CAG ACC CTG GTC AAT TCT 
Ala Asp His Leu Asn Ser Thr Asn His Ala He Val Gin Thr Leu Val Asn Ser 

1497 1512 1527 1542 

GTC AAT TCC AGT ATC CCC AAA GCC TGT TGT GTG CCC ACT GAA CTG AGT GCC ATC 
Val Asn Ser Ser He Pro Lys Ala Cys Cys Val Pro Thr Glu Leu Ser Ala He 

1557 1572 1587 

TCC ATG CTG TAC CTG GAT GAG TAT GAT AAG GTG GTA CTG AAA AAT TAT CAG GAG 
Ser MET Leu Tyr Leu A£P Glu Tyr Asp Lys Val Val Leu Lys Asn Tvr Gin Glu 

1602 1617 (408) 1636 1646 1656 

ATG GTA GTA GAG GGA TGT GGG TGC CGC TGAGATCAGG CAGTCCTTGA GG AT AG A CAG 
MET Val Val Glu Gly cys Gly Cys Arg 

1666 1676 1686 1696 1706 1716 1726 

ATATACACAC CACACACACA CACCACATAC ACCACACACA CACGTTCCCA TCCACTCACC CACACACTAC 



1736 1746 1756 1766 1776 1786 1796 

ACAGACTGCT TCCTTATAGC TGGACTTTTA TTTAAAAAAA AAAAAAAAAA AATGGAAAAA ATCCCTAAAC 



1806 1816 1826 1836 1846 1856 1866 

ATTCACCTTG ACCTTATTTA TGACTTTACG TGCAAATGTT TTGACCATAT TGATCATATA TTTTGACAAA 



1876 1886 1896 1906 1916 1926 1936 

ATATATTTAT AACTACGTAT TAAAAGAAAA AAATAAAATG AGTCATTATT TTAAAAAAAA AAAAAAAACT 



1946 

CTAGAGTCGA CGGAATTC 
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10 20 30 40 50 

GTGACCGAGC GGCGCGGACG GCCGCCTGCC CCCTCTGCCA CCTGGGGCGG 



60 70 80 90 99 

TGCGGGCCCG GAGCCCGGAG CCCGGGTAGC GCGTAGAGCC GGCGCG ATG 

MET 
(1) 

108 117 126 135 144 

CAC GTG CGC TCA CTG CGA GCT GCG GCG CCG CAC AGC TTC GTG GCG 
His Val Arg Ser Leu Arg Ala Ala Ala Pro His Ser Phe Val Ala 

153 .162 171 180 189 

CTC TGG GCA CCC CTG TTC CTG CTG CGC TCC GCC CTG GCC GAC TTC 
Leu Trp Ala Pro Leu Phe Leu Leu Arg Ser Ala Leu Ala Asp Phe 

198 207 216 225 234 

AGC CTG GAC AAC GAG GTG CAC TCG AGC TTC ATC CAC CGG CGC CTC 
Ser Leu Asp Asn Glu Val His Ser Ser Phe He His Arg Arg Leu 

243 252 261 270 279 

CGC AGC CAG GAG CGG CGG GAG ATG CAG CGC GAG ATC CTC TCC ATT 
Arg Ser Gin Glu Arg Arg Glu MET Gin Arg Glu He Leu Ser He 

288 297 306 315 324 

TTG GGC TTG CCC CAC CGC CCG CGC CCG CAC CTC CAG GGC AAG CAC 
Leu Gly Leu Pro His Arg Pro Arg Pro His Leu Gin Gly Lys His 

333 342 351 360 369 

AAC TCG GCA CCC ATG TTC ATG CTG GAC CTG TAC AAC GCC ATG GCG 
Asn Ser Ala Pro MET Phe MET Leu Asp Leu Tyr Asn Ala MET Ala 

378 387 396 405 414 

GTG GAG GAG GGC GGC GGG CCC GGC GGC CAG GGC TTC TCC TAC CCC 
Val Glu Glu Gly Gly Gly Pro Gly Gly Gin Gly Phe Ser Tyr Pro 

423 432 441 450 459 

TAC AAG GCC GTC TTC AGT ACC CAG GGC CCC CCT CTG GCC AGC CTG 
Tyr Lys Ala Val Phe Ser Thr Gin Gly Pro Pro Leu Ala Ser Leu 

468 477 486 495 504 

CAA GAT AGC CAT TTC CTC ACC GAC GCC GAC ATG GTC ATG AGC TTC 
Gin Asp Ser His Phe Leu Thr Asp Ala Asp MET Val MET Ser Phe 

513 522 531 540 549 

GTC AAC CTC GTG GAA CAT GAC AAG GAA TTC TTC CAC CCA CGC TAC 
Val Asn Leu Val Glu His Asp Lys Glu Phe Phe His Pro Arg Tyr 
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FIGURE 3B 



558 567 576 585 594 

CAC CAT CGA GAG TTC CGG TTT GAT CTT TCC AAG ATC CCA GAA GGG 
His His Arg Glu Phe Arg Phe Asp Leu Ser Lys lie Pro Glu Gly 

603 612 621 630 639 

GAA GCT GTC ACG GCA GCC GAA TTC CGG ATC TAC AAG GAC TAC ATC 
Glu Ala Val Thr Ala Ala Glu Phe Arg He Tyr Lys Asp Tyr He 

648 657 666 675 684 

CGG GAA CGC TTC GAC AAT GAG ACG TTC CGG ATC AGC GTT TAT CAG 
Arg Glu Arg Phe Asp Asn Glu Thr Phe Arg He Ser Val Tyr Gin 

693 702 711 720 729 

GTG CTC CAG GAG CAC TTG GGC AGG GAA TCG GAT CTC TTC CTG CTC 
Val Leu Gin Glu His Leu Gly Arg Glu Ser Asp Leu Phe Leu Leu 

738 747 756 765 774 

GAC AGC CGT ACC OTC TGG GCC TCG GAG GAG GGC TGG CTG GTG TTT 
Asp ser Arg Thr Leu Trp Ala Ser Glu Glu Gly Trp Leu Val Phe 

• 783 792 801 810 819 

GAC ATC ACA GCC ACC AGC AAC CAC TGG GTG GTC AAT CCG CGG CAC 
Asp He Thr Ala Thr Ser Asn His Trp Val Val Asn Pro Arg His 

828 837 846 855 864 

AAC CTG GGC CTG CAG CTC TCG GTG GAG ACG CTG GAT GGG CAG AGC 
Asn Leu Gly Leu Gin Leu Ser Val Glu Thr Leu Asp Gly Gin Ser 

873 882 891 900 909 

ATC AAC CCC AAG TTG GCG GGC CTG ATT GGG CGG CAC GGG CCC CAG 
He Asn Pro Lys Leu Ala Gly Leu He Gly Arg His Gly Pro Gin 

918 927 936 945 954 

AAC AAG CAG CCC TTC ATG GTG GCT TTC TTC AAG GCC ACG GAG GTC 
Asn Lys Gin Pro Phe MET Val Ala Phe Phe Lys Ala Thr Glu Val 

963 972 981 990 999 

CAC TTC CGC AGC ATC CGG TCC ACG GGG AGC AAA CAG CGC AGC CAG 
His Phe Arg Ser He Arg Ser Thr Gly Ser Lys Gin Arg Ser Gin 

(293) 

1008 1017 1026 1035 1044 

AAC CGC TCC AAG ACG CCC AAG AAC CAG GAA GCC CTG CGG ATG GCC 
Asn Arg Ser Lys Thr Pro Lys Asn Gin Glu Ala Leu Arg MET Ala 

1053 1062 1071 1080 1089 

AAC GTG GCA GAG AAC AGC AGC AGC GAC CAG AGG CAG GCC TGT AAG 
Asn Val Ala Glu Asn Ser Ser Ser Asp Gin Arg Gin Ala Cys Lys 
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FIGURE 3C 



JJ2 StS^P f TG TAT ^^ AGC TTc1 ^ GAG CTG^GC TGG CAG^lc 
LyS PlB Glu T -" T ^ v *l Ser PhP Arg Asp Leu Gly Trp Gin Asp 

??p uSES S CT 1f GGC TAC "« Gcc TAc1 ^ tgt gag'ggg 

Trp lie He Ala Pro Glu Gly Tyr Ala Ala Tyr Tyr Cys Glu Gly 

GAG TGT^GCC TTC CCT^TG AAC TCc"J!2 ATG AAC^W ACC AAC^lc 
Glu cys Ala Phe Pro Leu Asn Ser Tyr MET Asn tit Thr ten SS 

A?a JES? Sf° GTC CAC1 ^ ATC GAA ACOTO 

Ala He Val Gin Thr Leu Val His Phe He Asn Pro He Ser Val 

Pro f^P* TGC CCC ACG^Ig CTC Aa/gSc ATC TCC^TC 

Pro Lys Pro Cys Cys Ala Pro Thr Gin Leu Asn Ala lie Itr vll 

25 Tv^lll til f l CC ATC AAA TAC^Ia 

Leu Tyr Phe Asp Asp Ser Ser Asn- Val He Leu Lys Lys Tyr Arg 

1368 1377 1386 n .. 

ten m 5a? SIf f° G G ? C TGT GGC TGC CAC TAGCtSSSc 
Asn met Val Val Arg Ala Cys Gly cys His 

(431) 

1409 1419 1429 143Q 

GAGAATTCAG ACCCTTTGGG GCCAAGTTTT TCTGGATCCT CCATTGGTC 
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10 20 30 40 50 

CGACCATGAG AGATAAGGAC TGAGGGCCAG GAAGGGGAAG CGAGCCCGCC 



60 70 BO 90 100 

GAGAGGTGGC GGGGACTGCT CACGCCAAGG GCCACAGCGG CCGCGCTCCG 



110 120 130 140 150 

GCCTCGCTCC GCCGCTCCAC GCCTCGCGGG ATCCGCGGGG GCAGCCCGGC 

159 168 177 186 195 

CGGGCGGGG ATG CCG GGG CTG GGG CGG AGG GCG CAG TGG CTG TGC 
MET Pro Gly Leu Gly Arg Arg Ala Gin Trp Leu Cys 
(1) 

204 213 222 231 240 

TGG TGG TGG GGG CTG CTG TGC AGC TGC TGC GGG CCC CCG CCG CTG 
Trp Trp Trp Gly Leu Leu Cys Ser Cys Cys Gly Pro Pro Pro Leu 

249 258 267 276 285 

CGG CCG CCC TTG CCC GCT GCC GCG GCC GCC GCC GCC GGG GGG CAG 
Arg Pro Pro Leu Pro Ala Ala Ala Ala Ala Ala Ala Gly Gly Gin 

294 303 312 321 330 

CTG CTG GGG GAC GGC GGG AGC CCC GGC CGC ACG GAG CAG CCG CCG 
Leu Leu Gly Asp Gly Gly Ser Pro Gly Arg Thr Glu Gin Pro Pro 

339 348 357 366 375 

CCG TCG CCG CAG TCC TCC TCG GGC TTC CTG TAC CGG CGG CTC AAG 
Pro Ser Pro Gin Ser Ser Ser Gly Phe Leu Tyr Arg Arg Leu Lys 

384 393 402 411 420 

ACG CAG GAG AAG CGG GAG ATG CAG AAG GAG ATC TTG TCG GTG CTG 
Thr Gin Glu Lys Arg Glu MET Gin Lys Glu He Leu Ser Val Leu 



429 438 447 456 465 

GGG CTC CCG CAC CGG CCC CGG CCC CTG CAC GGC CTC CAA CAG CCG 
Gly Leu Pro His Arg Pro Arg Pro Leu His Gly Leu Gin Gin Pro 
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FIGURE 4B 



474 483 492 501 510 

CAG CCC CCG GCG CTC CGG CAG CAG GAG GAG CAG CAG CAG CAG CAG 
Gin Pro Pro Ala Leu Arg Gin Gin Glu Glu Gin Gin Gin Gin Gin 



519 528 537 546 555 

CAG CTG CCT CGC GGA GAG CCC CCT CCC GGG CGA CTG AAG TCC GCG 
Gin Leu Pro Arg Gly Glu Pro Pro Pro Gly Arg Leu Lys Ser Ala 



564 573 582 591 600 

CCC CTC TTC ATG CTG GAT CTG TAC AAC GCC CTG TCC GCC GAC AAC 
Pro Leu Phe MET Leu Asp Leu Tyr Asn Ala Leu Ser Ala Asp Asn 



609 618 627 636 645 

GAC GAG GAC GGG GCG TCG GAG GGG GAG AGG CAG CAG TCC TGG CCC 
Asp Glu Asp Gly Ala Ser Glu Gly Glu Arg Gin Gin Ser Trp Pro 



. 654 663 672 681 690 

CAC GAA GCA GCC AGC TCG TCC CAG CGT CGG CAG CCG CCC CCG GGC 
His Glu Ala Ala Ser Ser Ser Gin Arg Arg Gin Pro Pro Gly Ser 



699 708 717 726 735 

GCC GCG CAC CCG CTC AAC CGC AAG AGC CTT CTG GCC CCC GGA TCT 
Pro Pro Gly Ala Ala His Pro Leu Asn Arg Lys Ser Leu Leu Ala 



744 753 762 771 780 

GGC AGC GGC GGC GCG TCC CCA CTG ACC AGC GCG CAG GAC AGC GCC 
Gly Ser Gly Gly Ala Ser Pro Leu Thr Ser Ala Gin Asp Ser Ala 



789 

TTC CTC AAC 
Phe Leu Asn 



798 

GAC GCG GAC 
Asp Ala Asp 



807 

ATG GTC ATG 
MET Val MET 



816 

AGC TTT GTG 
Ser Phe Val 



825 

AAC CTG GTG 
Asn Leu Val 



834 843 852 861 870 

GAG TAC GAC AAG GAG TTC TCC CCT CGT CAG CGA CAC CAC AAA GAG 
Glu Tyr Asp Lys Glu Phe Ser Pro Arg Gin Arg His His Lys Glu 



879 

TTC AAG TTC 
Phe Lys Phe 

924 

GCT GCA iGAA 
Phe Arg lie 



888 

AAC TTA TCC 
Asn Leu Ser 

933 

TTC CGC ATC 
Tyr Lys Asp 



897 

CAG ATT CCT 
Gin lie Pro 

942 

TAC AAG GAC 
Cys Val MET 



906 

GAG GGT GAG 
Glu Gly Glu 

951 

TGT GTT ATG 
Ala Ala Glu 



915 

GTG GTG ACG 
Val Val Thr 

960 

GGG AGT TTT 
Gly Ser Phe 
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FIGURE 4C 



978 987 996 1005 

^ JAA ACT TTT CTT ATC AGC ATT TAT CAA GTC TTA CAG GAG 
Lye Asn Gin Thr Phe Leu He Ser He Tyr Gin Val Leu Gin Glu 

< 

1014 1023 1032 1041 1050 

S? C f 6 * GAC TCT GAC CTG TTT TTG TTG GAC ACC CGT GTA 
his Gin His Arg Asp Ser Asp Leu Phe Leu Leu Asp Thr Arg Val 

om» 1059 1068 1077 1086 1095 

vll E5 G f C GAA GGC TGG CTG GAA TTT GAC ATC ACG GCC 

Val Trp Ala Ser Glu Glu Gly Trp Leu Glu Phe Asp He Thr Ala 

1104 1113 1122 1131 1140 

A f ° AAT CTG TGG GTT GTG ACT CCA CAG CAT AAC ATG GGG CTT 
Thr Ser Asn Leu Trp Val Val Thr Pro Gin His Asn MET Gly Leu 

1149 H58 1167 H76 nnc: 

CAG CTG AGC GTG GTG ACA AGG GAT GGA GTC CAC GTC CAC CCC CGA 
Gin Leu Ser Val Val Thr Arg Asp Gly Val His Sal His Pro Sg 

1194 1203 1212 1221 1230 

S?? 5?i 2? C ? TG GTG GGC AGA GAC CCT TAC GAT AAG CAG CCC 

Ala Ala Gly Leu Val Gly Arg Asp Gly Pro Tyr Asp Lys Gin Pro 

-nw, » m 1239 1248 "57 1266 1275 

P^e Si vlt SET 5? C ^ C ?** GTG AGT GAG GTC OC GTG CGC ACC 
Phe MET Val Ala Phe Phe Lys Val Ser Glu Val His Val Arg Thr 

1284 1293 1302 1311 i 32 n 

ifS a™ G f C TCC AGC CGG CGC CGA CAA CAG AGT CGT AAT CGC 

Thr Arg Ser Ala Ser Ser Arg Arg Arg Gin Gin Ser Arg Asn Arg 

(382) 

1329 I 338 1347 1356 1365 

22 £52 Sf G J CC ^ GAC GTG GCG CGG GTC TCC AGT GCT TCA GAT 
(388? ABP Val Ala Arg Val Ser Ser Ala Ser Asp 

1374 1383 1392 1401 1410 

Ztr ^ A6C AGT GAA raG AAA ACA GCC TGC AGG AAG CAT GAG CTG 
Tyr Asn Ser Ser Glu Leu Lys Thr Ala Cys Arg Lys His Glu r^u 

(412) 

m„ m „ 1419 1428 I 437 1446 1455 

S? c GT S° CAA GAC CTG GGA TGG CAG GAC TGG ATC ATT GCA 
Tyr Val Ser Phe Gin Asp Leu Gly Trp Gin Asp Trp He He Ala 
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FIGURE 4D 



1464 
CCC AAG GGC 
Pro Lys Gly 



1473 
TAT GCT GCC 
Tyr Ala Ala 



1482 
AAT TAC TGT 
Asn Tyr Cys 



1491 
GAT GGA GAA 
Asp Gly GlU 



1500 
TGC TCC TTC 
Cys Ser Phe 



1509 1518 1527 1536 1545 

CCA CTC AAC GCA CAC ATG AAT GCA ACC AAC CAC GCG ATT GTG CAG 
Pro Leu Asn Ala His MET Asn Ala Thr Asn His Ala He Val Gin 



1554 1563 1572 1581 1590 

ACC TTG GTT CAC CTT ATG AAC CCC GAG TAT GTC CCC AAA CCG TGC 
Thr Leu Val His Leu MET Asn Pro Glu Tyr Val Pro Lys Pro Cys 

1599 1608 1617 1626 1635 

TGT GCG CCA ACT AAG CTA AAT GCC ATC TCG GTT CTT TAC TTT GAT 
cys Ala Pro Thr Lys Leu Asn Ala He Ser Val Leu Tyr Phe Asp 



1644 1653 1662 1671 1680 

GAC AAC TCC AAT GTC ATT CTG AAA AAA TAC AGG AAT ATG GTT GTA 
Asp Asn Ser Asn Val He Leu Lys Lys Tyr Arg Asn MET Val Val 



1689 1698 1708 1718 1728 

AGA GCT TGT GGA TGC CAC TAACTCGAAA CCAGATGCTG GGGACACACA 
Arg Ala Cys Gly Cys His 

(513) 



1738 


1748 


1758 


1768 


1778 


TTCTGCCTTG 


GATTCCTAGA 


TTACATCTGC 


CTTAAAAAAA 


CACGGAAGCA 


1788 


1798 


1808 


1818 


1828 


CAGTTGGAGG 


TGGGACGATG 


AGACTTTGAA 


ACTATCTCAT 


GCCAGTGCCT 


1838 


1848 


1858 


1868 


1878 


TATTACCCAG 


GAAGATTTTA 


AAGGACCTCA 


TTAATAATTT 


GCTCACTTGG 


1888 


1898 


1908 


1918 


1928 


TAAATGACGT 


GAGTAGTTGT 


TGGTCTGTAG 


CAAGCTGAGT 


TTGGATGTCT 


1938 


1948 


1958 


1968 


1978 


GTAGCATAAG 


GTCTGGTAAC 


TGCAGAAACA 


TAACCGTGAA 


GCTCTTCCTA 


1988 


1998 


2008 


2018 


2028 


CCCTCCTCCC 


CCAAAAACCC 


ACCAAAATTA 


GTTTTAGCTG 


TAGATCAAGC 


2038 


2048 


2058 


2068 


2078 


TATTTGGGGT 


GTTTGTTAGT 


AAATAGGGAA 


AATAATCTCA 


AAGGAGTTAA 


2088 


2098 


2108 


2118 


2128 


ATGTATTCTT 


GGCTAAAGGA 


TCAGCTGGTT 


CAGTACTGTC 


TATCAAAGGT 
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FIGURE 4E 



2138 2148 2158 2168 2178 

AGATTTTACA GAGAACAGAA ATCGGGGAAG TGGGGGGAAC GCCTCTGTTC 

2188 2198 2208 2218 2228 

AGTTCATTCC CAGAAGTCCA CAGGACGCAC AGCCCAGGCC ACAGCCAGGG 

2238 2248 2258 2268 2278 

CTCCACGGGG CGCCCTTGTC TCAGTCATTG CTGTTGTATG TTCGTGCTGG 

2288 2298 2308 2318 2328 

AGTTTTGTTG GTGTGAAAAT ACACTTATTT CAGCCAAAAC ATACCATTTC 

2338 2348 2358 2368 2378 

TACACCTCAA TCCTCCATTT GCTGTACTCT TTGCTAGTAC CAAAAGTAGA 

2388 2398 2408 2418 2428 

CTGATTACAC TGRGGTGAGG CTACAAGGGG TGTGTAACCG TGTAACACGT 

2438 244 8 2458 2468 2478 

GAAGGCAGTG CTCACCTCTT CTTTACCAGA ACGGTTCTTT GACCAGCACA 

2488 2498 2508 2518 252H 

TTAACTTCTG GACTGCCGGC TCTAGTACCT TTTCAGTAAA GTGGTTCTCT 



2538 



2548 



2558 



2568 



GCCTTTTTAC TATACAGCAT ACCACgEca£ AGGGTTAGaI CCAACGA^gI 



2588 



2598 



2608 



2618 



AAATAAAATG AGGGTGCCCA GCTTATAAGA atggtgttag GGGGATGAGC 

2638 2648 2658 2668 2S7R 

ATGCTGTTTA TGAACGGAAA TCATGATTTC CCTGTAGAAA GTGAGGCTCA 

2688 2698 2708 2718 2728 

GATTAAATTT TAGAATATTT TCTAAATGTC TTTTTCACAA TCATGTGACT 



2738 2748 2758 2768 2778 

GGGAAGGCAA TTTCATACTA AACTGATTAA ATAATACATT TATAATCTAC 

2788 2798 2808 2818 2828 

AACTGTTTGC ACTTACAGCT TTTTTTGTAA ATATAAACTA TAATTTATTG 

2838 2848 2858 2868 2878 

TCTATTTTAT ATCTGTTTTG CTGTGGCGTT GGGGGGGGGG CCGGGCTTTT 

2888 2898 2908 2918 

GGGGGGGGGG GTTTGTTTGG GGGGTGTCGT GGTGTGGGCG GGCGG 
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FIGURE 5A 



10 20 30 40 50 

CTGGTATATT TGTGCCTGCT GGAGGTGGAA TTAACAGTAA GAAGGAGAAA 

60 70 80 90 100 

GGGATTGAAT GGACTTACAG GAAGGATTTC AAGTAAATTC AGGGAAACAC 

110 120 130 140 150 

ATTTACTTGA ATAGTACAAC CTAGAGTATT ATTTTACACT AAGACGACAC 

160 170 180 190 200 

AAAAGATGTT AAAGTTATCA CCAAGCTGCC GGACAGATAT ATATTCCAAC 

210 220 230 240 250 

ACCAAGGTGC AGATCAGCAT AGATCTGTGA TTCAGAAATC AGGATTTGTT 

260 270 280 290 300 

TTGGAAAGAG CTCA^GGGTT GAGAAGAACT CAAAAGCAAG TGAAGATTAC 

310 320 330 340 350 

TTTGGGAACT ACAGTTTATC AGAAGATCAA CTTTTGCTAA TTCAAATACC 

360 370 380 390 400 

AAAGGCCTGA TTATCATAAA TTCATATAGG AATGCATAGG TCATCTGATC 

410 420 430 440 450 

AAATAATATT AGCCGTCTTC TGCTACATCA ATGCAGCAAA AACTCTTAAC 

460 470 480 490 500 

AACTGTGGAT AATTGGAAAT CTGAGTTTCA GCTTTCTTAG AAATAACTAC 

510 520 530 540 550 

TCTTGACATA TTCCAAAATA TTTAAAATAG GACAGGAAAA TCGGTGAGGA 

560 570 580 590 600 

TGTTGTGCTC AGAAATGTCA CTGTCATGAA AAATAGGTAA ATTTGTTTTT 

610 620 630 640 650 

TCAGCTACTG GGAAACTGTA CCTCCTAGAA CCTTAGGTTT TTTTTTTTTT 

660 670 680 690 700 

AAGAGGACAA GAAGGACTAA AAATATCAAC TTTTGCTTTT GGACAAAA 
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FIGURE SB 



701 710 719 728 737 

ATG CAT CTG ACT GTA TTT TTA CTT AAG GGT ATT GTG GGT TTC CTC 

MET His Leu Thr Val Phe Leu Leu Lys Gly lie Val Gly Phe Leu 

(1) 

746 755 764 773 782 

TGG AGC TGC TGG GTT CTA GTG GGT TAT GCA AAA GGA GGT TTG GGA 

Trp Ser Cys Trp Val Leu Val Gly Tyr Ala Lys Gly Gly Leu Gly 

791 800 809 818 827 

GAC AAT CAT GTT CAC TCC AGT TTT ATT TAT AGA AGA CTA CGG AAC 

Asp Asn His Val flis Ser Ser Phe lie Tyr Arg Arg Leu Arg Asn 

836 845 854 863 872 

CAC GAA AGA CGG GAA ATA CAA AGG GAA ATT CTC TCT ATC TTG GGT 

His Glu Arg Arg Glu lie Gin Arg Glu lie Leu Ser lie Leu Gly 

881 890 899 908 917 

TTG CCT CAC AGA CCC AGA CCA TTT TCA CCT GGA AAA ATG ACC AAT 

Leu Pro His Arg Pro Arg Pro Phe Ser Pro Gly Lys Gin Ala Ser 

926 935 944 953 962 

CAA GCG TCC TCT GCA CCT CTC TTT ATG CTG GAT CTC TAC AAT GCC 

Ser Ala Pro Leu Phe MET Leu Asp Leu Tyr Asn Ala MET Thr Asn 

971 980 989 998 1007 

GAA GAA AAT CCT GAA GAG TCG GAG TAC TCA GTA AGG GCA TCC TTG 

Glu Glu Asn Pro Glu Glu Ser Glu Tyr Ser Val Arg Ala Ser Leu 

1016 1025 1034 1043 1052 

GCA GAA GAG ACC AGA GGG GCA AGA AAG GGA TAC CCA GCC TCT CCC 
Ala Glu Glu Thr Arg Gly Ala Arg Lys Gly Tyr Pro Ala Ser Pro 

1061 1070 1079 1088 1097 

AAT GGG TAT CCT CGT CGC ATA CAG TTA TCT CGG ACG ACT CCT CTG 
Asn Gly Tyr Pro Arg Arg lie Gin Leu Ser Arg Thr Thr Pro Leu 

1106 1115 1124 1133 1142 

ACC ACC CAG AGT CCT CCT CTA GCC AGC CTC CAT GAT ACC AAC TTT 
Thr Thr Gin Ser Pro Pro Leu Ala Ser Leu His Asp Thr Asn Phe 

1151 1160 1169 1178 1187 

CTG AAT GAT GCT GAC ATG GTC ATG AGC TTT GTC AAC TTA GTT GAA 
Leu Asn Asp Ala Asp MET Val MET Ser Phe Val Asn Leu Val Glu 

1196 1205 1214 1223 1232 

AGA GAC AAG GAT TTT TCT CAC CAG CGA AGG CAT TAC AAA GAA TTT 
Arg Asp Lys Asp Phe Ser His Gin Arg Arg His Tyr Lys Glu Phe 
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FIGURE 5C 



1241 1250 1259 1268 1277 

CGA TTT GAT CTT ACC CAA ATT CCT CAT GGA GAG GCA GTG ACA GCA 
Arg Phe Asp Leu Thr Gin lie Pro His Gly Glu Ala Val Thr Ala 

1286 1295 1304 1313 1322 

GCT GAA TTC CGG ATA TAC AAG GAC CGG AGC AAC AAC CGA TTT GAA 
Ala Glu Phe Arg lie Tyr Lys Asp Arg Ser Asn Asn Arg Phe Glu 

1331 1340 1349 1358 1367 

AAT GAA ACA ATT AAG ATT AGC ATA TAT CAA ATC ATC AAG GAA TAC 
Asn Glu Thr He Lys He Ser He Tyr Gin He He Lys Glu Tyr 

1376 1385 1394 1403 1412 

ACA AAT AGG GAT GCA GAT CTG TTC TTG TTA GAC ACA AGA AAG GCC 
Thr Asn Arg Asp Ala Asp Leu Phe Leu Leu Asp Thr Arg Lys Ala 

1421 1430 1439 1448 1457 

CAA GCT TTA GAT GTG GGT TGG CTT GTC TTT GAT ATC ACT GTG ACC 
Gin Ala Leu Asp Val Gly Trp Leu Val Phe Asp He Thr Val Thr 

"1466 1475 1484 1493 1502 

AGC AAT CAT TGG GTG ATT AAT CCC CAG AAT AAT TTG GGC TTA CAG 
Ser Asn His Trp Val He Asn Pro Gin Asn Asn Leu Gly Leu Gin 

1511 1520 1529 1538 1547 

CTC TGT GCA GAA ACA GGG GAT GGA CGC AGT ATC AAC GTA AAA TCT 
Leu Cys Ala Glu Thr Gly Asp Gly Arg Ser He Asn Val Lys Ser 

1556 1565 1574 1583 1592 

GCT GGT CTT GTG GGA AGA CAG GGA CCT CAG TCA AAA CAA CCA TTC 
Ala Gly Leu Val Gly Arg Gin Gly Pro Gin Ser Lys Gin Pro Phe 

1601 1610 1619 1628 1637 

ATG GTG GCC TTC TTC AAG GCG AGT GAG GTA CTT CTT CGA TCC GTG 
MET Val Ala Phe Phe Lys Ala Ser Glu Val Leu Leu Arg Ser Val 

1646 1655 1664 1673 1682 

AGA GCA GCC AAC AAA CGA AAA AAT CAA AAC CGC AAT AAA TCC AGC 
Arg Ala Ala Asn Lys Arg Lys Asn Gin Asn Arg Asn Lys Ser Ser 

(329) 

1691 1700 1709 1718 1727 

TCT CAT CAG GAC TCC TCC AGA ATG TCC AGT GTT GGA GAT TAT AAC 
Ser His Gin Asn S er Ser Ara MET Ser Ser Val Gly Asp Tyr Asn 

(337) 



SUBSTITUTE SHEET 



rl,i/ ud>'4> w>^« 



18/32 



FIGURE 5D 



1736 1745 1754 1763 1772 

ACA AGT GAG CAA AAA CAA GCC TGT AAG AAG CAC GAA CTC TAT GTG 
Thr Ser Glu Gin Lys Gin Ala Cys Lys Lys His Glu Leu Tvr Val 

(356) 

1781 1790 1799 1808 1817 

AGC TTC CGG GAT CTG GGA TGG CAG GAC TGG ATT ATA GCA CCA GAA 
Ser Pfre Arg Asp Leu Gly Trp Gin Asp Trp lie lie Ala Pro Glu 
(362) 

1826 1835 1844 1853 1862 

GGA TAC GCT GCA TTT TAT TGT GAT GGA GAA TGT TCT TTT CCA CTT 
Gly Tyr Ala Ala Phe Tyr Cys Asp Gly Glu Cys Ser Phe Pro Leu 

1871 1880 1889 1898 1907 

AAC GCC CAT ATG AAT GCC ACC AAC CAC GCT ATA GTT CAG ACT CTG 
Asn Ala His MET Asn Ala Thr Asn His Ala He Val Gin Thr Leu 

1916 1925 1934 1943 1952 

GTT CAT CTG ATG TTT CCT GAC CAC GTA CCA AAG CCT TGT TGT GCT 
Val His Leu MET Phe Pro Asp His Val Pro Lys Pro Cys Cys Ala 

1961 1970 1979 1988 1997 ' 

CCA ACC AAA TTA AAT GCC ATC TCT GTT CTG TAC TTT GAT GAC AGC 
Pro Thr Lys Leu Asn Ala He Ser Val Leu Tyr Phe Asp Asp Ser 

2006 2015 2024 2033 2042 

TCC AAT GTC ATT TTG AAA AAA TAT AGA AAT ATG GTA GTA CGC TCA 
Ser Asn Val lie Leu Lys Lys Tyr Arg Asn MET Val Val Arg Ser 



2051 2060 2070 2080 2090 2100 

TGT GGC TGC CAC TAATATTAAA TAATATTGAT AATAACAAAA AGATCTGTAT 
Cys Gly Cys His 
(454) 

2110 2120 2130 2140 2150 

TAAGGTTTAT GGCTGCAATA AAAAGCATAC TTTCAGACAA ACAGAAAAAA AAA 
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Figure 6 

(1) 

GAATTCC GAG CCC CAT TGG AAG GAG TTC CGC TTT GAC CTG ACC CAG ATC CCG GCT 
Glu Pro His Trp Lys Glu Phe Arg Phe Asp Leu Thr Gin He Pro Ala 

(10) 

GGG GAG GCG GTC ACA GCT GCG GAG TTC CGG ATT TAC AAG GTG CCC AGC ATC CAC 
Gly Glu Ala Val Thr Ala Ala Glu Phe Arg He Tyr Lys Val Pro Ser He His 
(20) (30) 

CTG CTC AAC AGG ACC CTC CAC GTC AGC ATG TTC CAG GTG GTC CAG GAG CAG TCC 
Leu Leu Asn Arg Thr Leu His Val Ser Met Phe Gin Val Val Gin Glu Gin Ser 

(40) (50) 

AAC AGG GAG TCT GAC TTG TTC TTT TTG GAT CTT CAG ACG CTC CGA GCT GGA GAC 
Asn Arg Glu Ser Asp Leu Phe Phe Leu Asp Leu Gin Thr Leu Arg Ala Gly Asp 

(60) (70) 

GAG GGC TGG CTG GTG CTG GAT GTC ACA GCA GCC AGT GAC TGC TGG TTG CTG AAG 
Glu Gly Typ Leu Val Leu Asp Val Thr Ala Ala Ser Asp Cyc Trp Leu Leu Lys 

(80) 

CGT CAC AAG GAC CTG GGA CTC CGC CTC TAT GTG GAG ACT GAG GAT GGG CAC AGC 
Arg His Lys Asp Leu Gly Lue Arg Leu Tyr Val Glu Thr Glu Asp Gly His Ser 
(90) (100) 

GTG GAT CCT GGC CTG GCC GGC CTG CTG GGT CAA CGG GCC CCA CGC TCC CAA CAG 
Val Asp Pro Gly Leu Ala Gly Leu Leu Gly Gin Arg Ala Pro Arg Ser Gin Gin 
(HO) (120) 

CCT TTC GTG GTC ACT TTC TTC AGG GCC AGT CCG AGT CCC ATC CGC ACC CCT CGG 
Pro Phe Val Val Thr Phe Phe Arg Ala Ser Pro Ser Pro He Arg Thr Pro Arg 

(130) (140) 

GCA GTG AGG CCA CTG AGG AGG AGG CAG CCG AAG AAA AGC AAC GAG CTG CCG CAG 
Ala Val Arg Pro Leu Arg Arg Arg Gin Pro Lys Lys Ser Asn Glu Leu Pro Gin 

(150) (160) 

GCC AAC CGA CTC CCA GGG ATC TTT GAT GAC GTC CAC GGC TCC CAC GGC CGG CAG 
Ala Asn Arg Leu Pro Gly He Phe Asp Asp Val His Gly Ser His Gly Arg Gin 

(170) 

GTC TGC CGT CGG CAC GAG CTC TAC GTC AGC TTC CAG GAC CTT GGC TGG CTG GAC 
Val Cys Arg Arg His Glu Leu Tyr Val Ser Phe Gin Asp Leu Gly Trp Leu Asp 
(180) (190) 

TGG GTC ATC GCC CCC CAA GGC TAC TCA GCC TAT TAC TGT GAG GGG GAG TGC TCC 
Trp Val He Ala Pro Gin Gly Tyr Ser Ala Tyr Tyr Cys Glu Gly Glu Cys Ser 
(200) (210) 

TTC CCG CTG GAC TCC TGC ATG AAC GCC ACC AAC CAC GCC ATC CTG CAG TCC CTG 
Phe Pro Leu Asp Ser Cys Met Asn Ala Thr Asn His Ala He Leu Gin Ser Leu 

(220) (230) 
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GTG CAC CTG ATG AAG CCA AAC GCA GTC CCC AAG GCG TGC TGT GCA CCC ACC AAG 
Val His Leu Met Lys Pro Asn Ala Val Pro Lys Ala Cys Cys Ala Pro Thr Lys 

(240) (250) 

CTG AGC GCC ACC TCT GTG CTC TAC TAT GAC AGC AGC AAC AAC GTC ATC CTG CGC 
Leu Ser Ala Thr Ser Val Leu Tyr Tyr Asp Ser Ser Asn Asn Val He Leu Arg 

(260) 

AAG CAC CGC AAC ATG GTG GTC AAG GCC TGC GGC TGC CAC TGAGTCAGCCCGCCCAGC 
Lys His Arg Asn Met Val Val Lys Ala Cys Gly Cys His 
(270) (280) 

CCTACTGCAGCCACCCTTCTCATCTGGATCGGGCCCTGCAGAGGCAGAAAACCCTTAAATGCTGTCACAG 
CTCAAGCAGGAGTGTCAGGGGCCCTCACTCTCGGTGCCTACTTCCTGTCAGGCTTCTGGGAATTC 
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FIGURE 7 

OACCAAAOOO CCTCOTGATA COCCTATTW TATAOQTTAA TGTCATGATA ATAATOOTTT 60 

CTTAOACGTC AOOTQQCACT TTTCOOCCAA ATOTGCOCOO AACCCOTATt TOTTTAITXT 120 

TCTAAATACA TTCAAATATO TATCCOCTCA TOAOACAATA ACOOTGATAA ATOCITCAAt 180 

AATATTOAAA AAOOAAOACI ATOAOTATTC AACATITCOC TOTOQCCCTT ATTCCCTTTT 240 

WOCOOCAtt TTOCCITCCT CTTTTTOCTC ACCCASAAAC OCICOTOAAA GXAAAAOAtO 300 

CTOAAQATCA 0TTCC0T0CA COAGTOOOTt ACATCCAACT OOATCTCAAC AOCOOTAAGA 360 

TCCTTGAGAO TTTTCOCCOC GAAOAACGT? TTCCAATCAT GAOCACTITT AAAOTTCT6C 420 

IATOTOOCOC OOTATTATCC CGTATTOACG CCMOCAAGA OCAACTOOOT OCCOOCAtAC 480 

ACTATTOTOA OAATOACTTO OTIOACIACT CACCAOTCAC AQAAAAGCAT CTTACOOATO 520 

OOATOACACT AAOAOAAVTA TOCAOTGCTO OQATAACCAT OAOTOATAAC ACTOCGOCCA 600 

ACTTACTTCT 0ACAA06ATC OOAOOACOOA ACOAOCTAAC COCTTTTTTO CACAACATOO 660 

C80ATCAT0T AACTCCCCTT OATCGTTOOO AACCOOAOC* OAATOAAOCC ATACCAAA09 720 

ACCAGCOTOA CACCACOASO CCTOTACCAA TOGCAACAAC OTTOCOCAAA CIATTAACTO 780 

GCOAACTACT TACTCTA6CT ICCCCCCAAC AATTAATAOA ©WOATOGAC OCOOATAAAO 840 

TTQCAOOACG ACTTCTOCGC TCOOCCCTTC COOCTCGCTO GTT2ATT0C? OATAAATCTO 900 

OACCCCQTOA OCCTOOOICT CCCGOIATCA TTOCAOCACT OOOGCCAOAT OOTAAGCCC* 960 

CCCOTATOBT AOTTATCTAC ACOACCOOOA GTCAOOCAAC TATOOATOAA COAAATAGAC 1020 

AQA7COCT0A OATABCTOCC TCACMATTA AOCAWOOTA ACTOTCAOAC CAAOTTTACT 1080 

CATATATACI TTAGATTOAT TTAAAACTTC ATTTTTAATT TAAAAOOATC TAG0T0AA6A 1140 

TCCWTTTOA TAATCTOATO ACOAAAATCC CTTAACOTOA CTTTTOOTTC CACTOAGCOT 1200 

CAOACCCCOT AOAAAAOATC AAAOOATGT? CTTOAOATCC TTTTTTTCTG CCCOTAATCT 1260 

OCTGCTWCA AACAAAAAAA CCACCOCTAC CAOCOOTOCT TTCTTTOCCO OATCAAOAOO 1320 

TAOCAACTCT TTTTCCOAAO OTAACTOOCt TCAOCA0A0C GCACATACCA AATAOTOTCC 1380 

TTCTAflTOTA OCCOTAOTTA OOCCAeCACT TCAAGAACTC TOIAOCAOOO CCTACATACC 1440 

TCOCTCTOCT AATOCTOTTA CCAOTOOCTO CTOCCAOWO COAIAACTCO TOTCKACCO 1500 

OOTTGOACTC AAOACOATAO TTACCOOATA JWOCOCAOCe OTCGGGCTOA ACGOCGOOTT 1560 

COTOCAOACA OCCCAOCtW OAOeOAACOA OOTAOACCOA ACTOAOATAC CTACAOCOTO 1620 

AGCAMOAGA AA0C0CC!AC6 CTTCOCCAAO OOAOAAAOOC GOACAOOTAT CCOOTAACCG 1680 

OCAOCGTCOO AACAOOAOAO CGCACOAOOC ACCTTCCABO OOOAAACOCC TQGTATCTTT 1740 

ATAGTCCTOI CGCCTTTCOC CACCTCTOAC TTOAGCGTCO ATTTTTOTOA TOCTCOTCAG 1800 

0OO0GC60A0 CCTAtCOAAA AACCCCAOCA AOOCOOCOTT WTACGCTTC CTCCCCTTTT I860 

GCTOOCCTTT TOCTCACATO TTCTTTCCTO CCMATCCCC TCATTCTGTO OATAACCOTA 1920 
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FIGURE 7 (cont'd) 



TTACOCCCTT TCAOTOAGCT 


GATACCCCTC 


GCCCCAGCOG 


AACGACCGAG 


CGCACOGAGT 


1960 


CAGTGACCGA GGAAGCOOAA 


GAGCGGCCAA 


7ACOCAAACC 


GCCTCTCCCC 


OCGOOTTGCC 


2040 


COATTCATTA ATOCAOAATT 


GATCTCTCAC 


C7ACCAAACA 


ATGCCCCCCT 


CCAAAAAATA 


2100 


AATTCATATA AAAAACATAC 


AGATAACCA7 


GTGCGQTOAT 


AAATTATCTC 


TGGCGGTGTt 


2160 


OACATAAATA CCACTOGCGC 


TGATACTGAO 


CACATGAGCA 


GGACGCACTG 


ACCACCATGA 


2220 


AGGTOAOCCT CTTAAAAATT 


AAGCCCTOAA 


GAAGOOCAGC 


ATTCAAAGCA 


GAAOGCTTTG 


2260 


GOOTGTCTOA TACGAAACOA 


AGGATTOGCC 


OTAAGTGCGA 


TTCCGGATTA 


OCTGCCAATO 


2340 


TCCCAATCGC OOOCGGTTTT 


OCTTCAOGAC 


TACAACTGCC 


ACACACCACC 


AAAGCTAACT 


2400 


OACAGOAOAA TCCAOATGGA 


TGCACAAAGA 


CGCOCCCGOG 


AACGTCCOGC 


AGAOAAACAG 


2460 


OCTCAATOGA AACCAGCAAA 


TCCCCTGTTO 


OTTGGGGTAA 


GOGGAAAACC 


AG77COGAAA 


2120 


OATTTTTTTA ACTATAAACO 


CTGATGOAAG 


OOTTTATGOG 


GAAGAGCTAA 


AGCCCTTCCC 


2560 


GAGTAACAAA AAAACAACAO 


dXTAAATAAC 


CCCGCTCTTA 


CACATTCCAO 


CCCTGAAAAA 


2640 


GGGCATCAAA TTAAACCACA 


CCTATGGTGT 


A7CCATTTAT 


TTGCATACAT 


TCAATOAATT 


2700 


OTTATCTAAO OAAATACTTA 


CATATOCAAO 


CTAAACATAA 


ACAACGTAAA 


CGTCTGAAAT 


2760 


CTAGCTGTAA OACACACCCT 


TX0TA00TG6 


ACTTCAGTOA 


CGTGGGGTGG 


AA7GACTGGA 


2820 


TTGTGGCTCC CCCCOGOTAT 


CACOCCTTTT 


ACTQOCAOGO 


AGAATGCCCT 


TTTCCTCTGG 


2680 


CTOATCATCT GAACTCCACT 


AATCATOCCA 


TTGTTCAOAC 


GTTCGXCAAC 


TCTGTTAACT 


2940 


CTAAGATTCC TAACGCATOC 


TOTGTCCOCA 


CAGAACTCAO 


TOCTATCTCG 


ATGCTOTACC 


3000 


nOAOQAGAA TOAAAAGQTT 


OTATtAAAGA 


ACtATCAOGA 


GATGOTTGTO 


GAGGGT70TG 


3060 


GGTOTCOCTA OTACAGGAAA 


ATTAAATACA 


TAAATATATA 


TATATATATA 


tattttaoaa 


3120 


AAAAQAAAAA AATCTAflAOT 


CGACCTGCAG 


TAATCGTACA 


G0G7AGTACA 


aataaaaaag 


3180 


GCAOCTCAOA TOACGTGCCT 


TTTTTCTTOT 


GAGCAGTAAG 


CTTOCCACTG 


GCCOTCGTTT 


3240 


TACAACOTCO TGACTOGGAA 


AACCCTGOOO 


TTAOCCAACT 


TAAtCOCCTT 


GCAGCACATC 


3300 


CCCCTTTCOC CAGCTCGCOT 


AATAGCGAAG 


AGGCCOOCAC 


COATCGCCCT 


TCCCAACAOT 


3360 


TGCGCAOCCT OAATOOOOAA 


TOGCGOOTGA 


TGOOGTATTT 


TCTCCTTACG 


CATCTOTGOO 


3420 


GTATTTCACA CCOCATATAT 


GOTGCACTCT 


CAOTACAATC 


TOCTCTOATG 


CCQCATAGT? 


3480 


AAOCCAOCCC CGAGACCCOC 


CAACACCCOC 


TGACGCOCCC 


TOAOOGGCM 


OTCTGCTCCC 


3540 


OOCATCCOCT TACAGACAAG 


CTGTGACCGT 


CTOCGGGAOC 


TGCATGTG7C 


AGAOGTTTTC 


3600 


ACCGTCATCA CCGAAACGCG 


CGA 








3623 
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FIGURE 8 

W-20 ALKALINE PHOSPHATASE: BMP-2 VS. BMP-2/7 
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0.001 0.010 0.100 1.000 10.000 100.000 1000.00 



BMP (ng/ml) 



SUBSTITUTE SHEET 



24/32 



FIGURE 9 

EFFECTS OF BMP-2 AND BMP2/7 ON BGP SYNTHESIS 
BY W-20 CELLS 




BMP (ng/ml) 
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FIGURE 10 

COMPARAISON OF tf.Co/i BMP-2 AND BMP-2/7: 
W-20-17 ALKALINE PHOSPHATASE 




0.O10 0.100 1.000 10.000 100.000 1000. 000 
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FIGURE 11A 



10 20 30 40 50 60 70 

AGATCTTGAA AACACCCGGG CCACACACGC CGCGACCTAC AGCTCITrCT CAGCGTTGGA GTCGAGACGG 

80 90 100 110 120 130 140 

OGOCCGCAGC GCCCTGCGCG GGTCAGGTCC GOGCAGCTGC TGGGGAAGAG CXXACCTGTC AGGCTGOGCT 

150 160 170 180 190 200 210 

GGGTCAGCGC AGCAAGTGGG GCIGG003CT ATC3CGCTGC A0003GO03C GTCCOGGGCT COJIGCGCCC 

220 230 240 250 260 270 280 

TCGCOCCAGC TGGTTTGGAG TTCAAOOCTC GGCTCCGCCG OOQGCTOCTr GO30CITO3G AGTGTOCCGC 

290 300 310 320 (lj 335 

AGOSAOSCCG GGAGCCGACG CGCCGOGCGG GTAOCTAGOC ATG GCT GGG GOG AGC AGG CIG CTC 

MET Ala Gly Ala Ser Arg Leu Leu 

350 365 380 395 

TIT CIG TGG CIG GGC TGC TTC TGC GIG AGC CIG GOG CAG GGA GAG AGA COG AAG OCA 
Phe Leu Trp Leu Gly Cys Rie Cys Val Ser Leu Ala Gin Gly Glu Arg Pro Lys Pro 

410 425 440 455 

OCT TIC COG GAG CTC OGC AAA GCT GIG OCA GGT GAC CGC ACG GCAGGT GGT GGC COG 
Pro Phe Pro Glu Leu Arg Lys Ala Val Pro Gly Asp Arg Thr Ala Gly Gly Gly Pro 

470 485 500 515 

GAC TCC GAG CTG CAG COG CAA GAC AAG GTC TCP GAA CAC ATS CTG CGG CTC TAT GAC 
Asp Ser Glu Leu Gin Pro Gin Asp Lys Val Ser Glu His MET Leu Arg Leu Tyr Asp 

530 545 560 

AGG TAC AGC ACG GTC CAG GCG GCC CGG ACA COG GGC TCC CTG GAG GGA GGC TOG CAG 
Arg Tyr Ser Thr Val Gin Ala Ala Arg Thr Pro Gly Ser Leu Glu Gly Gly Ser Gin 

57 5 590 605 620 

CGC TGG CGC OCT CGG CTC CTG CGC GAA GGC AAC ACG GTT CGC AGC TIT CGG GOG GCA 

Pro Trp Arg Pro Arg Leu Leu Arg Glu Gly Asn Thr Val Arg Ser Phe Arg Ala Ala 

635 650 665 680 

GCA GCA GAA ACT CTT GAA AGA AAA GGA CTG TAT ATC TTC AAT CTG ACA TOG CTA ACC 
Ala Ala Glu Thr Leu Glu Arg Lys Gly Leu Tyr He Phe Asn Leu Thr Ser Leu Thr 

695 710 725 740 

AAG TCT GAA AAC ATT TTG TCT GCC ACA CTG TAT TTC TGT ATT GGA GAG CTA GGA AAC 
Lys Ser Glu Asn He Leu Ser Ala Thr Leu Tyr Phe Cys lie Gly Glu Leu Gly Asn 
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FIGURE 11C 



1430 1445 (377) i 46 o 1475 

TGC GOC AGG AGA TAC CTC AAG GTA GAC TIT GCA GAT ATT GGC TOG ACT GAA TOG ATT 
Cys Ala Arg Arg Tyr Leu Lys Val Asp Phe Ala Asp He Glv Trp Ser Glu Trp lie 

1490 1505 1520 1535 

ATC TCC OCX AAG TOC TTT GAT GOC TAT TAT TGC TCP GGA GCA TGC CAG TTC OCC ATG 
lie Ser Pro Lys Ser Phe Asp Ala Tvr Tyr Cvs Ser Glv Ala cys Gin Phe Pro MET 

1550' 1565 1580 1595 

OCA AAG TCT TIG AAG OCA TCA AAT CAT GCT AOC ATC CAG ACT ATA GTG AGA GCT GTG 
Pro Iys Ser Leu l ys Pro Ser Asn His Ala Thr lie Gin Ser He Val Arg Ala Val 

1610 1625 1640 1655 

GGG GTC GIT OCT GGG ATT OCT GAG OCT TGC TGT GTA CCA GAA AAG ATG TCC TCA CTC 
Gly Val Val Pro Gly lie Pro Glu Pro Cys Cys Val Pro Glu Lys MET Ser Ser Leu 

1670 1685 1700 

ACT ATT TEA. TIC TTT GAT GAA AAT AAG AAT GTA GTG CUT AAA GTA TAC CCT AAC ATG 
Ser lie Leu Phe Phe Asp Glu Asn Lys Asn Val Val Leu Lys Val Tvr Pro Asn MET 

L7:L5 1730 (472) 1746 1756 1766 1776 

ACA GTA GAG TCT TGC GCT TGC AGA TAAOCTGGCA AAGAACTCAT TTGAATGCTT AATTCAATCT 
Thr Val Glu Ser Cys Ala cys Arg 

1786 

CIAGAGTCGA CGGAATTC 
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Figure 12 



W-20 AUOUJNE PHOSPHATASE CHO BMP-2/6 vs. CHO BMP-2 



4 




0.1 1.0 10.0 100. 0 1000.0 
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FIGURE 13A 



CARTILAGE 
10 DAY IMPLANTS 



LEGEND 

^ BMP-2 
BMP-2/6 




fig BMP 



SUBSTITUTE SHEET 



30/32 



FIGURE 13B 
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FIGURE 14A 
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FIGURE 14B 
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